Novel 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, H1983, M1983, 38555 or 593 molecules and uses therefor

ABSTRACT

The invention provides isolated nucleic acids molecules, designated 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 and 593 nucleic acid molecules. The invention also provides antisense nucleic acid molecules, recombinant expression vectors containing 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 and 593 nucleic acid molecules, host cells into which the expression vectors have been introduced, and nonhuman transgenic animals in which a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 gene has been introduced or disrupted. The invention still further provides isolated 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 proteins, fusion proteins, antigenic peptides and anti-21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 antibodies. Diagnostic and therapeutic methods utilizing compositions of the invention are also provided.

RELATED APPLICATIONS

[0001] The present application is a continuation-in-part of U.S. patentapplication Ser. No. 10/278,036, filed Oct. 22, 2002 (pending), which isa continuation of U.S. patent application Ser. No. 09/711,216, filedNov. 9, 2000, which claims the benefit of U.S. Provisional ApplicationSerial No. 60/205,447, filed May 19, 2000. The present application isalso a continuation-in-part of U.S. patent application Ser. No.10/012,055, filed Nov. 13, 2001 (pending), which claims the benefit ofU.S. Provisional Application Serial No. 60/248,325, filed Nov. 14, 2000.The present application is also a continuation-in-part of U.S. patentapplication Ser. No. 10/003,690, filed Nov. 15, 2001 (pending), whichclaims the benefit of U.S. Provisional Application Serial No.60/248,893, filed Nov. 15, 2000. The present application is also acontinuation-in-part of U.S. patent application Ser. No. 09/797,039,filed Feb. 28, 2001 (pending), which claims the benefit of U.S.Provisional Application Serial No. 60/186,061, filed Feb. 29, 2000. Thepresent application is also a continuation-in-part of U.S. patentapplication Ser. No. 10/217,168, filed Aug. 12, 2002 (pending), whichclaims the benefit of U.S. Provisional Application Serial No.60/312,539, filed Aug. 15, 2001. The present application is also acontinuation-in-part of U.S. patent application Ser. No. 09/929,218,filed Aug. 14, 2001 (pending), which claims the benefit of U.S.Provisional Application Serial No. 60/257,511, filed Dec. 22, 2000. Thepresent application is also a continuation-in-part of U.S. patentapplication Ser. No. 09/963,159, filed Sep. 25, 2001 (pending), whichclaims the benefit of U.S. Provisional Application Serial No.60/234,922, filed Sep. 25, 2000. The present application is also acontinuation-in-part of U.S. patent application Ser. No. 10/121,911,filed Apr. 12, 2002 (pending), which is a divisional of U.S. patentapplication Ser. No. 09/412,210, filed Oct. 5, 1999, now U.S. Pat. No.6,403,358. The present application is also a continuation-in-part ofU.S. patent application Ser. No. 10/105,989, filed Mar. 25, 2002(pending), which is a continuation of U.S. patent application Ser. No.09/392,189, filed Sep. 9, 1999. The present application is also acontinuation-in-part of U.S. patent application Ser. No. 10/336,153,filed Jan. 3, 2003 (pending), which is a continuation of U.S. patentapplication Ser. No. 09/845,044, filed Apr. 27, 2001, which claims thebenefit of U.S. Provisional Application Serial No. 60/200,688, filedApr. 28, 2000. The present application is also a continuation-in-part ofU.S. patent application Ser. No. 09/928,531, filed Aug. 13, 2001(pending), which claims the benefit of U.S. Provisional ApplicationSerial No. 60/235,035, filed Sep. 25, 2000. The present application isalso a continuation-in-part of U.S. patent application Ser. No.09/920,346, filed Jul. 31, 2001 (pending), which claims the benefit ofU.S. Provisional Application Serial No. 60/221,925, filed Jul. 31, 2000.The present application is also a continuation-in-part of U.S. patentapplication Ser. No. 10/008,016, filed Nov. 8, 2001 (pending), whichclaims the benefit of U.S. Provisional Application Serial No.60/260,166, filed Jan. 5, 2001 and of U.S. Provisional ApplicationSerial No. 60/246,669, filed Nov. 8, 2000. The present application isalso a continuation-in-part of U.S. patent application Ser. No.09/909,743, filed Jul. 20, 2001 (pending), which is a divisional of U.S.patent application Ser. No. 09/448,076, filed Nov. 23, 1999, now U.S.Pat. No. 6,300,092, which is a continuation-in-part of U.S. patentapplication Ser. No. 09/276,400, filed Mar. 25, 1999, now U.S. Pat. No.6,140,056, which claims the benefit of U.S. Provisional ApplicationSerial No. 60/117,580, filed Jan. 27, 1999. The present application isalso a continuation-in-part of U.S. patent application Ser. No.10/336,489, filed Jan. 2, 2003 (pending), which is a continuation ofU.S. patent application Ser. No. 09/608,921, filed Jun. 30, 2000, whichis a continuation-in-part of U.S. patent application Ser. No.09/163,821, filed Sep. 30, 1998. The present application is also acontinuation-in-part of U.S. patent application Ser. No. 10/060,763,filed Jan. 30, 2002 (pending), which is a continuation of U.S. patentapplication Ser. No. 09/365,162, filed Jul. 30, 1999. The entirecontents of each of the above-referenced patent applications areincorporated herein by this reference.

BACKGROUND OF THE INVENTION

[0002] The enormous variety of biochemical reactions that comprise lifeare nearly all mediated by a series of biological catalysts known asenzymes. Enzymes are proteins which possess specific catalyticactivities that enable them to catalyze a series of reactions, henceenabling metabolic pathways to degrade and to reconstruct productsneeded to maintain organisms. By the binding of substrates throughgeometrically and physically complementary reactions, enzymes arestereospecific in binding substrates as well as in catalyzing reactions.The stringency for this stereospecificity varies as some enzymes aremore specific to the identity of their substrates, while others arecapable of binding multiple substrates and can catalyze numerous typesof reactions.

[0003] Examples of enzymes include, for example, guanylate kinases,phophatidylinositol 4-phosphate 5-kinases, kinases, transferases,aminopeptidases, adenylate cyclases, calpain proteases, oxidoreductases,neprilysin proteases, AMP binding enzymes and lysyl oxidases. Suchenzymes have the ability to, for example: (1) modulate ATP-dependentphosphorylation of GMP, dGMP, or cGMP; (2) catalyze the formation ofphosphoinositol-4,5-bisphosphate via the phosphorylation ofphosphatidylinositol-4-phosphate; (3) mediate the phosphoinositidesignaling cascade; (4) convert a substrate or target molecule to aproduct (e.g., transfer of a phosphate group to a substrate or targetmolecule, or conversion of ATP to ADP); (5) interact with and/orphosphate transfer to a second protein; (6) modulate intra- orintercellular signaling and/or gene transcription (e.g., either directlyor indirectly); (7) modulate the phosphorylation state of targetmolecules (e.g., a kinase or a phosphatase molecule) or thephosphorylation state of one or more proteins involved in cellulargrowth, metabolism, or differentiation, e.g., cardiac, epithelial, orneuronal cell growth or differentiation; (8) convert a substrate ortarget molecule to a product (e.g., transfer of a methyl group to orfrom the substrate or target molecule); (9) interact with and/or methyltransfer to a second target molecule e.g., a nucleic acid molecule(e.g., DNA or RNA), a small organic molecule (e.g., a hormone,neurotransmitter or a coenzyme) or a protein; (10) cleave a proteinprecursor to maturation; (11) catalyze protein degradation; (12)catalyze the formation of a covalent bond within or between an aminoacid residue (e.g., a serine or threonine residue) and a phosphatemoiety; (13) modulate the cAMP signal transduction pathway; (14)modulate a target cell's cAMP concentration; (15) modulatecAMP-dependent protein kinase activity, such as protein kinase A; (16)modulate a calpain protease response; (17) modulate metabolism andcatabolism of biochemical molecules, e.g., molecules necessary forenergy production or storage; (18) modulate betaine synthesis fromcholine; (19) modulate methionine synthesis from homocysteine; (20)modulate the activity of a bioactive peptide, (21) cleave a neprilysinsubstrate, e.g., enkephalin; (22) modulate membrane excitability, (23)influence the resting potential of membranes; (24) modulate acetyl-CoAligase activity; (25) promote activation of acetate; (26) promoteacetate utilization; (27) enhance uptake of acetate into fatty acids andbiochemical products made from fatty acids (e.g., lipids and hormonessuch as sterol hormones); (28) crosslink an extracellular matrixcomponent; (29) regulate bone resorption and/or metabolism; and (30)regulate copper metabolism. Accordingly, there exists a need to identifyadditional human enzymes, for example, for use as disease markers and astargets for identifying various therapeutic modulators.

SUMMARY OF THE INVENTION

[0004] The present invention is based, at least in part, on thediscovery of novel nucleic acid molecules and proteins encoded by suchnucleic acid molecules, referred to herein as “21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593”. The 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleic acid andprotein molecules of the present invention are useful as modulatingagents in regulating a variety of cellular processes, e.g., includingcell proliferation, differentiation, growth and division. In particular,these nucleic acid molecules will be advantageous in the regulation ofany cellular function, uncontrolled proliferation and differentiation,such as in cases of cancer. Accordingly, in one aspect, this inventionprovides isolated nucleic acid molecules encoding 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 proteins or biologicallyactive portions thereof, as well as nucleic acid fragments suitable asprimers or hybridization probes for the detection of 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593-encoding nucleicacids.

[0005] The nucleotide sequence of the cDNA encoding 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593, and the amino acidsequence of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 polypeptides are depicted in Table 1. TABLE 1 Sequences of theinvention cDNA Protein Coding Region Gene Name (SEQ ID NO:) (SEQ ID NO:)(SEQ ID NO:) 21910 SEQ ID NO: 1 SEQ ID NO: 2 SEQ ID NO: 3 56634 SEQ IDNO: 5 SEQ ID NO: 6 SEQ ID NO: 7 55053 SEQ ID NO: 10 SEQ ID NO: 11 SEQ IDNO: 12 2504 SEQ ID NO: 18 SEQ ID NO: 19 SEQ ID NO: 20 15977 SEQ ID NO:21 SEQ ID NO: 22 SEQ ID NO: 23 14760 SEQ ID NO: 24 SEQ ID NO: 25 SEQ IDNO: 26 25501 SEQ ID NO: 31 SEQ ID NO: 32 SEQ ID NO: 33 17903 SEQ ID NO:39 SEQ ID NO: 40 SEQ ID NO: 41  3700 SEQ ID NO: 43 SEQ ID NO: 44 SEQ IDNO: 45 21529 SEQ ID NO: 46 SEQ ID NO: 47 SEQ ID NO: 48 26176 SEQ ID NO:49 SEQ ID NO: 50 SEQ ID NO: 51 26343 SEQ ID NO: 54 SEQ ID NO: 55 SEQ IDNO: 56 56638 SEQ ID NO: 57 SEQ ID NO: 58 SEQ ID NO: 59 18610 SEQ ID NO:63 SEQ ID NO: 64 SEQ ID NO: 65 33217 SEQ ID NO: 66 SEQ ID NO: 67 SEQ IDNO: 68 21967 SEQ ID NO: 71 SEQ ID NO: 72 SEQ ID NO: 73 h1983 SEQ ID NO:88 SEQ ID NO: 89 SEQ ID NO: 90 m1983 SEQ ID NO: 104 SEQ ID NO: 105 SEQID NO: 106 38555 SEQ ID NO: 107 SEQ ID NO: 108 SEQ ID NO: 109  593 SEQID NO: 111 SEQ ID NO: 112 SEQ ID NO: 113

[0006] Accordingly, in one aspect, the invention features a nucleic acidmolecule which encodes a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein or polypeptide, e.g., a biologically activeportion of the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein. In a preferred embodiment, the isolated nucleicacid molecule encodes a polypeptide having the amino acid sequence ofSEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72,89, 105, 108 or 112. In other embodiments, the invention providesisolated 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 nucleic acid molecules having the nucleotide sequence shown in SEQID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43,45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104,106, 107, 109, 111 or 113 or the nucleotide sequence of the DNA insertof the plasmid deposited with ATCC Accession Number ______. In stillother embodiments, the invention provides nucleic acid molecules thatare substantially identical (e.g., naturally occurring allelic variants)to the nucleotide sequence shown in SEQ ID NO:1, 3, 5, 7, 10, 12, 18,20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57,59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113 orthe nucleotide sequence of the DNA insert of the plasmid deposited withATCC Accession Number ______. In other embodiments, the inventionprovides a nucleic acid molecule which hybridizes under a stringenthybridization condition as described herein to a nucleic acid moleculecomprising the nucleotide sequence of SEQ ID NO:1, 3, 5, 7, 10, 12, 18,20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57,59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113 orthe nucleotide sequence of the DNA insert of the plasmid deposited withATCC Accession Number ______, wherein the nucleic acid encodes a fulllength 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein or an active fragment thereof.

[0007] In a related aspect, the invention further provides nucleic acidconstructs which include a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 nucleic acid molecule described herein. Incertain embodiments, the nucleic acid molecules of the invention areoperatively linked to native or heterologous regulatory sequences. Alsoincluded are vectors and host cells containing the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleic acid moleculesof the invention e.g., vectors and host cells suitable for producingpolypeptides.

[0008] In another related aspect, the invention provides nucleic acidfragments suitable as primers or hybridization probes for the detectionof 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593-encoding nucleic acids.

[0009] In still another related aspect, isolated nucleic acid moleculesthat are antisense to a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 encoding nucleic acid molecule are provided.

[0010] In another aspect, the invention features 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptides, andbiologically active or antigenic fragments thereof that are useful,e.g., as reagents or targets in assays applicable to treatment anddiagnosis of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593-associated disorders. In another embodiment, the inventionprovides 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 polypeptides having a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 activity.

[0011] In other embodiments, the invention provides 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptides, e.g., a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593polypeptide having the amino acid sequence shown in SEQ ID NO:2, 6, 11,19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112or the amino acid sequence encoded by the cDNA insert of the plasmiddeposited with ATCC Accession Number ______; an amino acid sequence thatis substantially identical to the amino acid sequence shown in SEQ IDNO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89,105, 108 or 112 or the amino acid sequence encoded by the cDNA insert ofthe plasmid deposited with ATCC Accession Number ______; or an aminoacid sequence encoded by a nucleic acid molecule having a nucleotidesequence which hybridizes under a stringent hybridization condition asdescribed herein to a nucleic acid molecule comprising the nucleotidesequence of SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31,33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71,73, 88, 90, 104, 106, 107, 109, 111 or 113 or the nucleotide sequence ofthe insert of the plasmid deposited with ATCC Accession Number ______,wherein the nucleic acid encodes a full length 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein or an activefragment thereof.

[0012] In a related aspect, the invention further provides nucleic acidconstructs which include a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 nucleic acid molecule described herein.

[0013] In a related aspect, the invention provides 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptides orfragments operatively linked to non-21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 polypeptides to form fusion proteins.

[0014] In another aspect, the invention features antibodies andantigen-binding fragments thereof, that react with, or more preferablyspecifically or selectively bind 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 polypeptides.

[0015] In another aspect, the invention provides methods of screeningfor compounds that modulate the expression or activity of the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593polypeptides or nucleic acids.

[0016] In still another aspect, the invention provides a process formodulating 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 polypeptide or nucleic acid expression or activity, e.g., using thecompounds identified in the screens described herein. In certainembodiments, the methods involve treatment of conditions related toaberrant activity or expression of the 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 polypeptides or nucleic acids, such asconditions or disorders involving aberrant or deficient 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 expression.Examples of such disorders include, but are not limited to cellularproliferative and/or differentiative disorders, brain disorders,platelet disorders, breast disorders, colon disorders, kidney (renal)disorders, lung disorders, ovarian disorders, prostate disorders,cervical disorders, spleen disorders, thymus disorders, thyroiddisorders, testis disorders, hematopoeitic disorders, pancreaticdisorders, skeletal muscle disorders, skin (dermal) disorders, disordersassociated with bone metabolism, immune, e.g., inflammatory, disorders,cardiovascular disorders, endothelial cell disorders, liver disorders,viral diseases, pain disorders, metabolic disorders, neurological or CNSdisorders, erythroid disorders, blood vessel disorders or angiogenicdisorders.

[0017] The invention also provides assays for determining the activityof or the presence or absence of 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 polypeptides or nucleic acid moleculesin a biological sample, including for disease diagnosis.

[0018] In a further aspect, the invention provides assays fordetermining the presence or absence of a genetic alteration in a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593polypeptide or nucleic acid molecule, including for disease diagnosis.

[0019] In another aspect, the invention features a two dimensional arrayhaving a plurality of addresses, each address of the plurality beingpositionally distinguishable from each other address of the plurality,and each address of the plurality having a unique capture probe, e.g., anucleic acid or peptide sequence. At least one address of the pluralityhas a capture probe that recognizes a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 molecule. In one embodiment, thecapture probe is a nucleic acid, e.g., a probe complementary to a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleicacid sequence. In another embodiment, the capture probe is apolypeptide, e.g., an antibody specific for 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 polypeptides. Also featured isa method of analyzing a sample by contacting the sample to theaforementioned array and detecting binding of the sample to the array.

[0020] Other features and advantages of the invention will be apparentfrom the following detailed description, and from the claims.

DETAILED DESCRIPTION OF THE INVENTION

[0021] Human 21910

[0022] The present invention is based, at least in part, on thediscovery of novel molecules, referred to herein as “membrane-associatedguanylate kinase”, “MAGK” or “21910” nucleic acid and protein molecules.Guanylate kinase molecules are novel members of a family of enzymespossessing kinase activity. Guanylate kinases are essential enzymes innucleotide metabolism pathways catalyzing the ATP-dependentphosphorylation of either GMP to GDP or dGMP to dGDP. Guanyate kinasemolecules also function in the recovery of cGMP (cGMP→GMP→GDP→GTP→cGMP)thereby serving to regulate the supply of guanine nucleotides to signaltransduction pathway components (Brady et al. (1996) J. Biol. Chem.271(28):16734-40; Kumar, et al. (2000) Eur. J. Biochem. 267(2):606).Guanylate kinases are essential to a wide range of cellular processesincluding but not limited to nucleotide metabolic processes (e.g.,supplying the building blocks for nucleic acids), phototransductionprocesses (e.g., regulating the opening and/or closing of cGMPgated-channels), cellular growth and proliferation, and signalingpathways (Fitzgibbon, et al (1996) FEBS Letters 385:185-188).

[0023] Membrane-bound forms of guanylate kinase molecules have also beendiscovered. Members of the membrane-associated guanylate kinase familyinteract with the cytoskeleton of the cell and regulate cellproliferation, signaling pathways, and intercellular junctions. (Kim, etal. (1996) Genomics 31(2):223). These molecules participate in theassembly of multiprotein complexes on the inner surface of the plasmamembrane and cluster ion channels, receptors, adhesion molecules andcytosolic signaling proteins at synapses, cellular junctions, andpolarized membrane domains (Fannin and Anderson (1999) Curr. Opin. CellBiol. 11(4):432; Dobrosotskaya, et al. (1997) J. Biol. Chem.272(50):31589). In addition, membrane-associated guanylate kinases haverecently been found to have a transcriptional regulatory function(Hsueh, et al. (2000) Nature 404(6775):298). Typically, these moleculescontain multiple protein-protein interaction motifs including a PDZdomain in the N-terminal portion of the protein, followed by a SH3domain, followed by a guanylate kinase domain at the C-terminus(Dobrosotskaya, et al., supra). Membrane-associated guanylate kinaseshave been found to be localized to tight junctions in epithelial cellmembranes and more notably in neuronal cells (Wu, et al. (2000) Proc.Natl. Acad. Sci. USA 97(8):4233); Hsuesh, supra).

[0024] In humans, guanylate kinases are used as targets for cancerchemotherapy and have been found to be inhibited by the antitumor drug,6-thioguanine. In addition, guanylate kinase activity is required forthe activation of antiviral drugs such as acyclovir and ganciclovir invirus-infected cells (Brady et al., supra).

[0025] Members of the guanylate kinase family have been identified inmany organisms, including E. coli, yeast, mouse, and human. Greaterconservation has been found between mammalian guanylate kinases thanbetween mammalian and yeast or E. coli. However, the overall structureof the molecule is conserved, including conservation of a “giant anionhole” active site which functions to bind nucleoside triphosphates(Brady et al., supra; Stehle and Schulz (1992) J. Mol. Biol. 224(4):1127).

[0026] The MAGK molecules of the present invention, through associationwith cell surface signaling complexes involved in cellular growth andproliferation, may play a role in the modulation of cellular growthsignaling mechanisms. As used herein, the terms “cellular growthsignaling mechanisms,” “cell signaling,” or “cell growth signaling”includes signal transmission from a cell surface signaling complex whichregulates, for example, 1) cell transversal through the cell cycle, 2)cell differentiation, 3) cell survival, and/or 4) cell migration.

[0027] In a preferred embodiment, the MAGK molecules of the presentinvention are involved in metabolic processes of the cell and in themodulation of cellular growth signaling mechanisms. Thus, the MAGKmolecules may modulate cellular growth, differentiation, or migration,and may play a role in disorders characterized by aberrantly regulatedgrowth, proliferation, differentiation, or migration. Accordingly, inone aspect, the present invention provides methods and compositions forthe diagnosis and treatment of a cellular growth or proliferationdisease or disorder, e.g., cancer, including, but not limited to, lungcancer and colon cancer.

[0028] The term “treatment” as used herein, is defined as theapplication or administration of a therapeutic agent to a patient, orapplication or administration of a therapeutic agent to an isolatedtissue or cell line from a patient, who has a disease, a symptom ofdisease or a predisposition toward a disease, with the purpose to cure,heal, alleviate, relieve, alter, remedy, ameliorate, improve or affectthe disease, the symptoms of disease or the predisposition towarddisease. A therapeutic agent includes, but is not limited to, smallmolecules, peptides, antibodies, ribozymes and antisenseoligonucleotides.

[0029] A “cellular growth or proliferation disease or disorder” includesthose diseases or disorders that affect cell growth or proliferationprocesses. As used herein, a “cellular growth or proliferation process”is a process by which a cell increases in number, size or content, bywhich a cell develops a specialized set of characteristics which differfrom that of other cells, or by which a cell moves closer to or furtherfrom a particular location or stimulus. Such disorders include, but arenot limited to, cancer, e.g., carcinoma, sarcoma, or leukemia, examplesof which include, but are not limited to, colon, lung, liver, ovary, andbreast; tumorigenesis and metastasis; skeletal dysplasia; hepaticdisorders; and hematopoietic and/or myeloproliferative disorders.

[0030] The novel MAGK molecules of the present invention have increasedexpression in tumor cells, e.g., lung tumor cells and colon tumor cells,as compared to normal lung and colon cells. Increased expression of MAGKin tumor cells results in an increase in cell growth signaling, therebyincreasing the cellular growth and proliferation of tumor cells.Accordingly, the MAGK molecules of the present invention provide noveldiagnostic targets and therapeutic agents to control MAGK-relateddisorders, e.g., cellular growth or proliferation diseases or disorders,e.g., cancer, including, but not limited to colon cancer or lung cancer.Accordingly, the present invention further provides methods foridentifying the presence of a MAGK nucleic acid or polypeptide moleculeassociated with a cellular growth or proliferation disease or disorder.In addition, the invention provides methods for identifying a subject atrisk for a cellular growth or proliferation disease or disorder, bydetecting the presence of a MAGK nucleic acid or polypeptide molecule,or by detecting aberrant or abnormal MAGK expression or activity.

[0031] The invention also provides a method for identifying a compoundcapable of treating a cellular growth or proliferation disease ordisorder, characterized by aberrant MAGK nucleic acid expression or MAGKprotein activity by assaying the ability of the compound to modulate theexpression of a MAGK nucleic acid or the activity of a MAGK protein.Furthermore, the invention provides a method for treating a subjecthaving a cellular growth or proliferation disease or disordercharacterized by aberrant MAGK protein activity or aberrant MAGK nucleicacid expression by administering to the subject a MAGK modulator whichis capable of modulating MAGK protein activity or MAGK nucleic acidexpression.

[0032] Moreover, the invention provides a method for identifying acompound capable of modulating cellular growth and/or proliferation andcellular signaling by modulating the expression of a MAGK nucleic acidor the activity of a MAGK protein. The invention provides a method formodulating cellular growth and/or proliferation and cellular signalingcomprising contacting an endothelial cell with a MAGK modulator.

[0033] The present invention is directed to novel members of theguanylate kinase family of enzymes, e.g. the MAGK proteins, biologicallyactive fragments thereof, homologues thereof, and/or nucleic acidmolecules encoding such proteins, homologues and/or biologically activefragments, and the use thereof for treating and/or diagnosing a cellulargrowth or proliferation disease or disorder. The term “family” whenreferring to the protein and nucleic acid molecules of the invention isintended to mean two or more proteins or nucleic acid molecules having acommon structural domain or motif and having sufficient amino acid ornucleotide sequence homology as defined herein. Such family members canbe naturally or non-naturally occurring and can be from either the sameor different species. For example, a family can contain a first proteinof human origin, as well as other, distinct proteins of human origin oralternatively, can contain homologues of non-human origin, e.g., mouseor monkey proteins. Members of a family may also have common functionalcharacteristics.

[0034] Accordingly, in one embodiment, a MAGK molecule of the presentinvention is identified based on the presence of a “ATP/GTP-binding sitemotif A (P-loop)” in the protein or corresponding nucleic acid molecule.As used herein, the term “ATP/GTP-binding site motif A (P-loop)”includes a protein motif having an amino acid sequence of about 8 aminoacid residues. Preferably, a P-loop has about 5-8 residues and thefollowing consensus sequence: [AG]-X(4)-G-K-[ST] (SEQ ID NO:4) (SarasteM., Sibbald P. R., Wittinghofer A. (1990) Trends Biochem. Sci.15:430-434). To identify the presence of a ATP/GTP-binding site motif A(P-loop) in a MAGK protein, and make the determination that a protein ofinterest has a particular motif, the amino acid sequence of the proteinmay be searched against a database of known protein motifs (e.g., theProSite database). The ATP/GTP-binding site motif A (P-loop) has beenassigned ProSite accession number PS00017. A search was performedagainst the ProSite database resulting in the identification of aATP/GTP-binding site motif A (P-loop) in the amino acid sequence ofhuman MAGK (SEQ ID NO:2) at about residues 404-411 of SEQ ID NO:2.

[0035] In another embodiment, a MAGK molecule of the present inventionis identified based on the presence of a “guanylate kinase domain” inthe protein or corresponding nucleic acid molecule. As used herein, theterm “guanylate kinase domain” includes a protein domain having an aminoacid sequence of about 50-200 amino acid residues and a bit score ofabout 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180,190, 200, 210, or 220 or more. Preferably, a guanylate kinase domainincludes at least about 100-200, or more preferably about 109 amino acidresidues, and a bit score of at least 139.4. To identify the presence ofa guanylate kinase domain in a MAGK protein, and make the determinationthat a protein of interest has a particular profile, the amino acidsequence of the protein may be searched against a database of knownprotein domains (e.g., the HMM database). A search was performed againstthe HMM database resulting in the identification of a guanylate kinasedomain in the amino acid sequence of human MAGK (SEQ ID NO:2) at aboutresidues 515-624 of SEQ ID NO:2.

[0036] A guanylate kinase domain can further be characterized based onthe presence of a guanylate kinase consensus sequence in the protein orcorresponding nucleic acid molecule. As used herein, the term “guanylatekinase domain” includes a protein motif having an amino acid sequence ofabout 18 amino acid residues. Preferably, a guanylate kinase domain hasabout 15-20 residues. To identify the presence of a guanylate kinasedomain in a MAGK protein, and make the determination that a protein ofinterest has a particular motif, the amino acid sequence of the proteinmay be searched against a database of known protein motifs (e.g., theProSite database). The guanylate kinase domain has been assigned ProSiteaccession number. PS00856. A search was performed against the ProSitedatabase resulting in the identification of a guanylate kinase domain inthe amino acid sequence of human MAGK (SEQ ID NO:2) at about residues514-531 of SEQ ID NO:2.

[0037] In another embodiment, a MAGK molecule of the present inventionis identified based on the presence of a “PDZ domain” in the protein orcorresponding nucleic acid molecule. As used herein, the term “PDZdomain” includes a protein domain having an amino acid sequence of about50-200 amino acid residues and a bit score of about 20, 30, 40, 50, 60,70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190 or 200 ormore. Preferably, a PDZ domain includes at least about 50-150, or morepreferably about 79 amino acid residues, and a bit score of at least52.4. To identify the presence of a PDZ domain in a MAGK protein, andmake the determination that a protein of interest has a particularprofile, the amino acid sequence of the protein may be searched againsta database of known protein domains (e.g., the HMM database). A searchwas performed against the HMM database resulting in the identificationof a PDZ domain in the amino acid sequence of human MAGK (SEQ ID NO:2)at about residues 256-335 of SEQ ID NO:2.

[0038] In another embodiment, a MAGK molecule of the present inventionis identified based on the presence of a “SH3 domain” in the protein orcorresponding nucleic acid molecule. As used herein, the term “SH3domain” includes a protein domain having an amino acid sequence of about50-150 amino acid residues and a bit score of about 5, 10, 20, 30, 40,50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150 or more. Preferably, aSH3 domain includes at least about 50-100, or more preferably about 67amino acid residues, and a bit score of at least 5.2. To identify thepresence of a SH3 domain in a MAGK protein, and make the determinationthat a protein of interest has a particular profile, the amino acidsequence of the protein may be searched against a database of knownprotein domains (e.g., the HMM database). A search was performed againstthe HMM database resulting in the identification of a SH3 domain in theamino acid sequence of human MAGK (SEQ ID NO:2) at about residues348-415 of SEQ ID NO:2.

[0039] In a preferred embodiment, the MAGK molecules of the inventioninclude at least one, preferably two, more preferably three or more ormore of the following domains: an ATP/GTP-binding site motif A (P-loop),a guanylate kinase domain, a PDZ domain, and a SH3 domain.

[0040] In yet another embodiment, isolated proteins of the presentinvention, preferably MAGK proteins, have an amino acid sequencesufficiently identical to the amino acid sequence of SEQ ID NO:2, or areencoded by a nucleotide sequence sufficiently identical to SEQ ID NO:1or 3. As used herein, the term “sufficiently identical” refers to afirst amino acid or nucleotide sequence which contains a sufficient orminimum number of identical or equivalent (e.g., an amino acid residuewhich has a similar side chain) amino acid residues or nucleotides to asecond amino acid or nucleotide sequence such that the first and secondamino acid or nucleotide sequences share common structural domains ormotifs and/or a common functional activity. For example, amino acid ornucleotide sequences which share common structural domains have at least30%, 40%, or 50% homology, preferably 60% homology, more preferably70%-80%, and even more preferably 90-95% homology across the amino acidsequences of the domains and contain at least one and preferably twostructural domains or motifs, are defined herein as sufficientlyidentical. Furthermore, amino acid or nucleotide sequences which shareat least 30%, 40%, or 50%, preferably 60%, more preferably 70-80%, or90-95% homology and share a common functional activity are definedherein as sufficiently identical.

[0041] As used interchangeably herein, an “MAGK activity”, “biologicalactivity of MAGK,” or “functional activity of MAGK,” refers to anactivity exerted by a MAGK protein, polypeptide or nucleic acid moleculeon a MAGK responsive cell or tissue, or on a MAGK protein substrate, asdetermined in vivo, or in vitro, according to standard techniques. Asused herein, a “membrane-associated guanylate kinase activity” includesATP-dependent phosphorylation of GMP (or dGMP) into GDP (or dGDP)involved, for example, in the production of molecules necessary forsignal transduction, cell signaling, cellular growth, cellularproliferation, and the like. In one embodiment, a MAGK activity is adirect activity, such as an association with a MAGK-target molecule. Asused herein, a “target molecule” or “binding partner” is a molecule withwhich a MAGK protein binds or interacts in nature, such thatMAGK-mediated function is achieved, e.g., modulation of cellularsignaling, growth, and/or proliferation. A MAGK target molecule can be anon-MAGK molecule or a MAGK protein or polypeptide of the presentinvention (e.g., ATP). In an exemplary embodiment, a MAGK targetmolecule is a MAGK ligand (e.g., GMP, dGMP). Alternatively, a MAGKactivity is an indirect activity, such as a cellular signaling activitymediated by interaction of the MAGK protein with a MAGK ligand. Thebiological activities of MAGK are described herein. For example, theMAGK proteins of the present invention can have one or more of thefollowing activities: i) interaction of a MAGK protein molecule with anon-MAGK protein molecule (e.g. GMP, ATP), ii) modification of a MAGKsubstrate (e.g. GMP or dGMP), iii) assembly of protein complexes atcell-junctions, iv) interaction with the cellular cytoskeleton, and v)interaction between a membrane-bound MAGK protein and a non-MAGKprotein. In yet another preferred embodiment, a MAGK activity is atleast one or more of the following activities: 1) modulation ofATP-dependent phosphorylation of GMP, dGMP, or cGMP 2) modulation ofcellular signal transduction, 3) modulation of metabolism or catabolismof metabolically important biomolecules (e.g., nucleotides), 4)modulation of cellular growth and differentiation, 5) modulation ofcellular proliferation, a 6) modulation of cell signaling mechanisms,e.g., cellular growth signaling mechanisms, 7) modulation ofintercellular junctions, 8) modulation of transcription, and 9)modulation of paracellular pathways.

[0042] Accordingly, another embodiment of the invention featuresisolated MAGK proteins and polypeptides having a MAGK activity. Otherpreferred proteins are MAGK proteins having one or more of the followingdomains: an ATP/GTP-binding site motif A (P-loop), a guanylate kinasedomain, a PDZ domain, a SH3 domain, and, preferably, a MAGK activity.

[0043] Additional preferred proteins have one or more of the followingdomains: an ATP/GTP-binding site motif A (P-loop), a guanylate kinasedomain, a PDZ domain, a SH3 domain, and are, preferably, encoded by anucleic acid molecule having a nucleotide sequence which hybridizesunder stringent hybridization conditions to a complement of a nucleicacid molecule comprising the nucleotide sequence of SEQ ID NO:1 or 3.

[0044] The nucleotide sequence of the isolated human MAGK cDNA and thepredicted amino acid sequence of the human MAGK polypeptide are shown inSEQ ID NOs:1 and 2, respectively. A plasmid containing the nucleotidesequence encoding human MAGK, was deposited with the American TypeCulture Collection (ATCC), 10801 University Boulevard, Manassas, Va.20110-2209, on ______ and assigned Accession Numbers ______. Thisdeposit will be maintained under the terms of the Budapest Treaty on theInternational Recognition of the Deposit of Microorganisms for thePurposes of Patent Procedure. This deposit was made merely as aconvenience for those of skill in the art and is not an admission that adeposit is required under 35 U.S.C. §112.

[0045] Isolation of the 21910 or “MAGK” cDNA

[0046] The invention is based, at least in part, on the discovery of ahuman gene encoding a novel protein, referred to herein as 21910 orMAGK. The entire sequence of human clone Fbh21910 was determined andfound to contain an open reading frame termed human “21910” or “MAGK”,set forth in SEQ ID NO:1 and 3. The 74.36 kD MAGK protein comprisesabout 675 amino acids and is shown in SEQ ID NO:2. The coding region(open reading frame) of SEQ ID NO:1, is set forth as SEQ ID NO:3. CloneFbh21910, comprising the coding region of human MAGK, was deposited withthe American Type Culture Collection (ATCC®), 10801 UniversityBoulevard, Manassas, Va. 20110-2209, on ______, and assigned AccessionNo. ______.

[0047] Analysis of the Human 21910 or MAGK Molecule

[0048] The amino acid sequence of human MAGK was analyzed using theprogram PSORT to predict the localization of the protein within thecell. This program assesses the presence of different targeting andlocalization amino acid sequences within the query sequence. The resultsof the analysis predict that human MAGK (SEQ ID NO:2) is intracellular(e.g. nuclear, cytoplasmic, cytoskeletal).

[0049] A search of the amino acid sequence of MAGK was also performedagainst the ProSite database. This search resulted in the identificationof a “ATP/GTP-binding site motif A (P-loop)” in the amino acid sequenceof MAGK (SEQ ID NO:2) at about residues 404-411 and a “guanylate kinasesignature” in the amino acid sequence of MAGK (SEQ ID NO:2) at aboutresidues 514-531. This search also resulted in the identification of apotential N-glycosylation site at about residues 82-85 of SEQ ID NO:2, anumber of potential protein kinase C phosphorylation sites at aboutresidues 84-86, 130-132, 253-255, 270-272, 432-434, 514-516, 517-519,562-564, 569-571, 576-578, 581-583, and 584-586 of SEQ ID NO:2, a numberof potential casein kinase II phosphorylation sites at about residues14-17, 25-28, 97-100, 137-140, 143-146, 383-386, 422-425, 465-468,517-520, 558-561, and 646-649 of SEQ ID NO:2, a tyrosine kinasephosphorylation site at about residues 586-593 of SEQ ID NO:2, a numberof potential N-myristoylation sites at about residues 205-210, 247-2525,and 405-410 of SEQ ID NO:2, and a potential amidation site at aboutresidues 72-76 of SEQ ID NO:2.

[0050] A search of the amino acid sequence of MAGK was also performedagainst the HMM database. This search resulted in the identification ofa “guanylate kinase domain” in the amino acid sequence of MAGK (SEQ IDNO:2) at about residues 515-624 (score=139.4), a “PDZ domain” in theamino acid sequence of MAGK (SEQ ID NO:2) at about residues 256-335(score=52.4), and a “SH3 domain” in the amino acid sequence of MAGK (SEQID NO:2) at about residues 348-415 (score=5.2).

[0051] Other HMM hits of interest that were identified in the HMMdatabase include, for example, a “NAD-dependent DNA ligase domain” atabout residues 529-535 of SEQ ID NO:2 (score=2.3), an “X-Prodipeptidyl-peptidase domain” at about residues 642-658 of SEQ ID NO:2(score=−0.0), and a “caulimovirus movement protein domain” at aboutresidues 420-673 of SEQ ID NO:2 (score=−184.0).

[0052] Tissue Distribution of 21910 or MAGK by In Situ Analysis

[0053] For in situ analysis, various tissues, e.g. tissues obtained fromnormal lung and colon and lung and colon tumors, were first frozen ondry ice.

[0054] In situ hybridization results indicated no expression in 2 normallung samples. By contrast, expression was detected in 2 of 4 lung tumorsamples. Results further indicated no expression in 3 normal tumorsamples and strong expression in 4 of 4 primary colon tumors tested and3 of 3 colon metastases tested. Breast and ovary tissue also showedtumor specific expression.

[0055] Tissue Expression Analysis of 21910 or MAGK mRNA Using TaqMan™Analysis

[0056] This example describes the tissue distribution of human MAGK mRNA(huMAGK) in a variety of cells and tissues, as determined using theTaqMan™ procedure.

[0057] The expression levels of human 21910 or MAGK mRNA in varioushuman cell types and tissues was first determined in an array profilingexperiment comparing the expression of genes in lung tumor cell linesversus normal bronchial epithelium. These experiments demonstrated thatMAGK expression is increased 2-fold in a small cell lung tumor line ascompared to normal epithelium.

[0058] The RNA used in the array profiling experiment was isolated fromthe following cell lines: NHBE (available from Clonetics®) and NC1-H69(available from ATCC®). NHBE cells were grown in BEGM (bronchialepithelium growth) Bulletkit® medium. The cells were grown to 80%confluency in a T175 flask and harvested for RNA by the Qiagen® Midi RNApreparation method. NC1-H69 cells were grown in suspension in T175flasks in RPMI+2% Hyclone FBS, 2 mM L-Glutamine, 10 mM HEPES, and 1/100Gibco® Selenium/Insulin/Transferrin supplement medium. RNA was preparedwith the Qiagen® RNA Midi Kit, as directed by the manufacturer.

[0059] The expression levels of human 21910 or MAGK mRNA in varioushuman cell types and tissues were analyzed in detail in a secondexperiment using the TaqMan™ procedure. As shown in Table 2, the highest21910 or MAGK expression was detected in brain, epithelial cells, andfetal heart. TABLE 2 Expression of Human MAGK Mean huMAGK CT MeanNormalized Tissue Source Value Beta 2 CT Value Expression Aorta/normal35.91 24.30 0.52 Fetal heart/normal 27.07 20.91 22.72 Heart/normal 27.9920.00 6.39 Heart/CHF 29.27 21.82 9.32 Vein/normal 30.94 20.60 1.25Spinal cord/normal 27.43 20.11 10.17 Brain cortex/normal 26.85 22.1763.15 Brain hypothalamus 26.48 21.08 38.47 Glial cells (Astro) 27.6922.54 45.91 Brain/Glioblastoma 28.13 19.46 3.99 Breast/normal 29.0320.52 4.47 Breast tumor/IDC 29.08 19.77 2.56 OVARY/normal 31.24 21.992.67 OVARY/tumor 29.61 20.44 2.82 Pancreas 32.24 25.20 12.34Prostate/normal 28.34 20.32 6.26 Prostate/tumor 27.04 19.23 7.24Colon/normal 27.83 19.13 3.91 Colon/tumor 26.83 19.82 12.60 Colon/IBD29.96 19.39 1.05 Kidney/normal 28.14 21.61 17.58 Liver/normal 29.7620.11 2.02 Liver fibrosis 30.51 21.19 2.54 Fetal liver/normal 30.6222.42 5.54 Lung/normal 28.89 19.04 1.77 Lung/tumor 28.32 19.55 3.73Lung/COPD 28.09 19.19 3.40 Spleen/normal 33.65 21.52 0.36 Tonsil/normal30.00 19.09 0.85 Lymphnode/normal 30.47 19.71 0.94 Thymus/normal 28.2920.49 7.26 Epithelial Cells 27.68 21.46 21.72 Endothelial Cells 30.7722.01 3.73 Skeletal Muscle 29.17 21.74 9.42 Fibroblasts (Dermal) 30.3820.04 1.26 Skin/normal 31.58 22.05 2.20 Adipose/normal 29.83 20.08 1.89Osteoblast (primary) 29.21 21.17 6.19 Osteoblasts (Undiff) 28.89 20.093.64 Osteoblasts (Diff.) 28.59 19.16 2.36 Osteoclasts 30.91 18.58 0.32Aortic SMC Early 28.86 21.39 9.16 Aortic SMC Late 31.23 24.20 12.47Shear HUVE C 28.63 21.41 10.93 Static HUVE C 28.75 21.56 11.16Osteoclast (Undiff.) 32.69 17.97 0.06

[0060] As shown in Table 3, increased expression of human 21910 or MAGKwas detected in 6 of 8 lung tumor samples (T) versus normal lung tissuesamples (N). As shown in Table 4, increased expression of huMAGK wasdetected in 4 of 7 colon tumor samples (T) versus normal colon tissuesamples (N). TABLE 3 Human MAGK Expression in Clinical Lung Samples MeanhuMAGK CT Mean Normalized Tissue Source Value Beta 2 CT Value ExpressionLung N 33.1 22.3 6.2 Lung N 29.3 19.1 9.3 Lung N 24.9 15.2 13.1 Lung N26.9 16.4 7.3 Lung T 24.9 16.3 30.3 Lung T 25.7 17.5 37.2 Lung T 28.117.9 9.2 Lung T 26.6 17.2 16.3 Lung T 26.9 19.2 54.4 Lung T 27.8 19.329.5 Lung T 27.0 17.9 20.1 Lung T 26.4 18.0 31.7

[0061] TABLE 4 Human MAGK Expression in Clinical Colon Samples MeanhuMAGK Mean Beta 2 CT Normalized Tissue Source CT Value Value ExpressionColon N 26.8 16.9 13.7 Colon N 30.4 21.0 18.6 Colon N 27.9 18.1 15.0Colon N 25.7 16.8 27.7 Colon T 24.4 16.3 49.2 Colon T 24.3 17.3 102.6Colon T 25.2 16.2 25.3 Colon T 26.3 17.1 21.4 Colon T 24.4 16.4 49.0Colon T 32.0 23.6 37.7 Colon T 25.5 16.1 19.2 Liver Met 26.4 17.2 21.8Liver Met 29.0 19.6 19.7 Liver Met 28.8 18.1 7.7 Liver Met 29.4 17.8 4.1Liver N 28.9 17.4 4.3 Liver N 31.2 23.0 44.7

[0062] These data reveal a significant up-regulation of MAGK mRNA incolon and lung carcinomas. Given that the mRNA for MAGK is expressed ina variety of tumors, with significant up-regulation in carcinoma samplesin comparison to normal samples, it is believed that inhibition of MAGKactivity may inhibit tumor progression by inhibiting cell growthsignaling and cellular growth and proliferation.

[0063] Human 56634

[0064] The present invention is based, at least in part, on thediscovery of a novel phosphatidylinositol 4-phosphate 5-kinase termed56634. The human 56634 sequence (SEQ ID NO:5), which is approximately3224 nucleotides long including untranslated regions, contains apredicted methionine-initiated coding sequence of about 1266nucleotides, including the termination codon. The coding sequenceencodes a 421 amino acid protein (SEQ ID NO:6).

[0065] Human 56634 contains the following regions or other structuralfeatures: a phosphatidylinositol 4-phosphate 5-kinase domain (PFAMAccession Number PF01504) located at about amino acid residues 72 to 421of SEQ ID NO:6; one predicted N-glycosylation site (PS0001) at aboutamino acids 165 to 168 of SEQ ID NO:6; seven predicted Protein Kinase Cphosphorylation sites (PS00005) at about amino acids 28 to 30, 79 to81,208 to 210,229 to 231,239 to 241, 338 to 340, and 391 to 393 of SEQID NO:6; ten predicted Casein Kinase II phosphorylation sites (PS00006)located at about amino 58 to 61, 132 to 135, 155 to 158, 229 to 232, 239to 242, 294 to 297, 307 to 310, 327 to 330, 349 to 352, and 377 to 380of SEQ ID NO:6; one predicted tyrosine kinase phosphorylation sites(PS00007) from about amino acid 114 to 122 of SEQ ID NO:6; and fourpredicted N-myristoylation sites (PS00008) from about amino acid 54 to59, 221 to 226, 323 to 328, and 397 to 402 of SEQ ID NO:6.

[0066] For general information regarding PFAM identifiers, PS prefix andPF prefix domain identification numbers, refer to Sonnhammer et al.(1997) Protein 28:405-420.

[0067] A plasmid containing the nucleotide sequence encoding human 56634(clone “Fbh56634FL”) was deposited with American Type Culture Collection(ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209, on ______and assigned Accession Number ______. This deposit will be maintainedunder the terms of the Budapest Treaty on the International Recognitionof the Deposit of Microorganisms for the Purposes of Patent Procedure.This deposit was made merely as a convenience for those of skill in theart and is not an admission that a deposit is required under 35 U.S.C.§112.

[0068] Signal transduction through phosphoinositol lipids plays animportant role in various cellular processes, including vesicularsecretion, cytoskeletal organization, and cell growth anddifferentiation. The phosphatidylinositol (PI) signal transductionpathway is regulated, in part, by the conversion of PI, a membrane lipidbearing a sugar moiety attached via an intermediate phosphate residue,into singly, doubly, and triply phosphorylated products (Carpenter andCantley (1996) Curr Opin Cell Biol 8:153-158). A crucial step in thepathway occurs when phosphatidylinositol 4-phosphate (PIP) isphosphorylated to become phosphatidylinositol 4,5-bis-phosphate (PIP2),a step catalyzed by phosphatidylinositol 4-phosphate 5-kinase(Boronenkov and Anderson (1995) J Biol Chem 270:2881-2884.). Thehydrolysis of PIP2 by phospholipase C (PLC) produces the secondmessengers diacylglycerol (DAG) and inositol tris-phosphate (IP3). DAGis an activator of protein kinase C (PKC) and IP3 plays an importantrole in the release of intracellular calcium. In addition, PIP2 isconverted into phosphatidylinositol 3,4,5-tris-phosphate, whichactivates some PKC isoforms. Thus, the phosphatidylinositol 4-phosphate5-kinase family of proteins plays an important role in the regulation ofthe phosphoinositide signaling cascade by catalyzing key phosphorylationevents.

[0069] The 56634 protein contains a significant number of structuralcharacteristics in common with members of the phosphatidylinositol4-phosphate 5-kinase (PIP5K) family. The phosphatidylinositol4-phosphate 5-kinase family comprises a number of related enzymes thatshare a common catalytic mechanism. PIP5K catalyses the formation ofphosphoinositol-4,5-bisphosphate via the phosphorylation ofphosphatidylinositol-4-phosphate, a precursor in the phosphinositidesignaling pathway. Phosphatidylinositol 4-phosphate 5-kinase has beenshown to be required for vesicular secretion and trafficking of a widevariety of cells (Hay et al. (1995) Nature 374:173-7; Yamamoto et al.(1995) Mol Biol Cell 6:525-39). In addition, there is evidence thatphosphatidylinositol 4-phosphate 5-kinase is involved in signaltransduction and regulation of the actin cytoskeleton via theinteraction with the Rho family of small G proteins (Chong et al. (1994)Cell 79:507-13; Ren et al. (1996) Mol Biol Cell 7:435-442), suggesting arole in cell movement and metastasis.

[0070] Thus, this (PIP5K) family includes enzymes critical for theproper function of many physiological systems, including vesiclesecretion and trafficking, cell signaling, and cellular proliferationand differentiation.

[0071] A 56634 polypeptide can include a “phosphatidylinositol4-phosphate 5-kinase domain” or regions homologous with a“phosphatidylinositol 4-phosphate 5-kinase domain”.

[0072] As used herein, the term “phosphatidylinositol 4-phosphate5-kinase domain” includes an amino acid sequence of about 200-500 aminoacid residues in length and having a bit score for the alignment of thesequence to the phosphatidylinositol 4-phosphate 5-kinase domain profile(Pfam HMM) of at least 100. Preferably, a phosphatidylinositol4-phosphate 5-kinase domain includes at least about 200 to 500 aminoacids, more preferably about 250 to 450 amino acid residues, or about300 to 400 amino acids and has a bit score for the alignment of thesequence to the phosphatidylinositol 4-phosphate 5-kinase domain (HMM)of at least 100, preferably at least 200, 300, 400 or greater. Thephosphatidylinositol 4-phosphate 5-kinase domain (HMM) has been assignedthe PFAM Accession Number PF01504. The phosphatidylinositol 4-phosphate5-kinase domain (HMM) has been assigned the SMART identifier PIPK_(—)2.An alignment of the phosphatidylinositol 4-phosphate 5-kinase domain(amino acids 72 to 421 of SEQ ID NO:6) of human 56634 with the PIPK_(—)2consensus amino acid sequences derived from a hidden Markov modelderived from SMART yielded a score of 586.8 (E=1.4e−172). The PIPK_(—)2sequence is depicted as SEQ ID NO:9. An alignments of thephosphatidylinositol 4-phosphate 5-kinase domain (amino acids 124 to 420of SEQ ID NO:6) of human 56634 with the PIP5K consensus amino acidsequences derived from a hidden Markov model derived from PFAM yielded ascore of 530.2 (E=1.5e−155). The PIP5K sequence is depicted as SEQ IDNO: 8.

[0073] In a preferred embodiment 56634 polypeptide or protein has a“phosphatidylinositol 4-phosphate 5-kinase domain” or a region whichincludes at least about 200 to 500, more preferably about 250 to 450, or300 to 400 amino acid residues and has at least about 60%, 70% 80% 90%95%, 99%, or 100% homology with a “phosphatidylinositol 4-phosphate5-kin ase,” e.g., the phosphatidylinositol 4-phosphate 5-kinase domainof human 56634 (e.g., residues 72 to 421 of SEQ ID NO:6).

[0074] To identify the presence of a “phosphatidylinositol 4-phosphate5-kinase” domain in a 56634 protein sequence, and make the determinationthat a polypeptide or protein of interest has a particular profile, theamino acid sequence of the protein can be searched against the Pfamdatabase of HMMs (e.g., the Pfam database, release 2.1) using thedefault parameters. For example, the hmmsf program, which is availableas part of the HMMER package of search programs, is a family specificdefault program for MILPAT0063 and a score of 15 is the defaultthreshold score for determining a hit. Alternatively, the thresholdscore for determining a hit can be lowered (e.g., to 8 bits). Adescription of the Pfam database can be found in Sonhammer et al. (1997)Proteins 28(3):405-420 and a detailed description of HMMs can be found,for example, in Gribskov et al. (1990) Meth. Enzymol. 183:146-159;Gribskov et al. (1987) Proc. Natl. Acad. Sci. USA 84:4355-4358; Krogh etal. (1994) J. Mol. Biol. 235:1501-1531; and Stultz et al. (1993) ProteinSci. 2:305-314, the contents of which are incorporated herein byreference. A search was performed against the HMM database resulting inthe identification of a “phosphatidylinositol 4-phosphate 5-kinase”domain in the amino acid sequence of human 56634 at about residues124-420 of SEQ ID NO:6. The sequence of the identified Pfam“phosphatidylinositol 4-phosphate 5-kinase” domain is depicted in SEQ IDNO:8.

[0075] To identify the presence of a “phosphatidylinositol 4-phosphate5-kinase” domain in a 56634 protein sequence, and make the determinationthat a polypeptide or protein of interest has a particular profile, theamino acid sequence of the protein can be searched against a SMARTdatabase (Simple Modular Architecture Research Tool) of HMMs asdescribed in Schultz et al. (1998), Proc. Natl. Acad. Sci. USA 95:5857and Schultz et al. (200) Nucl. Acids Res 28:231. The database containsdomains identified by profiling with the hidden Markov models of theHMMer2 search program (R. Durbin et al. (1998) Biological sequenceanalysis: probabilistic models of proteins and nucleic acids. CambridgeUniversity Press). The database also is extensively annotated andmonitored by experts to enhance accuracy. A search was performed againstthe HMM database resulting in the identification of a“phosphatidylinositol 4-phosphate 5-kinase” domain in the amino acidsequence of human 56634 at about residues 72 to 421 of SEQ ID NO:6. Thesequence of the identified SMART “phosphatidylinositol 4-phosphate5-kinase” domain is depicted in SEQ ID NO:9.

[0076] A 56634 polypeptide can include a “phosphatidylinositol4-phosphate 5-kinase domain” or regions homologous with a“phosphatidylinositol 4-phosphate 5-kinase domain.” A 56634 polypeptidecan optionally further include at least one N-glycosylation site; atleast one, two, three, four, five, six, preferably seven protein kinaseC phosphorylation sites; at least one, two, three, four, five, six,seven, eight, nine, preferably ten, casein kinase II phosphorylationsites; at least one tyrosine kinase phosphorylation site; at least one,two, three, preferably four, N-myristylation sites.

[0077] As the 56634 polypeptides of the invention may modulate56634-mediated activities, they may be useful as of for developing noveldiagnostic and therapeutic agents for 56634-mediated or relateddisorders, e.g., cancer, as described below.

[0078] As used herein, a “56634 activity”, “biological activity of56634” or “functional activity of 56634”, refers to an activity exertedby a 56634 protein, polypeptide or nucleic acid molecule. For example, a56634 activity can be an activity exerted by 56634 in a physiologicalmilieu on, e.g., a 56634-responsive cell or on a 56634 substrate, e.g.,a protein substrate. A 56634 activity can be determined in vivo or invitro. In one embodiment, a 56634 activity is a direct activity, such asan association with a 56634 target molecule. A “target molecule” or“binding partner” is a molecule with which a 56634 protein binds orinteracts in nature. In an exemplary embodiment, 56634 is an enzyme forconverting phosphatidylinositol 4-phosphate (PIP) tophosphatidylinositol 4,5-bis-phosphate (PIP2).

[0079] A 56634 activity can also be an indirect activity, e.g., acellular signaling activity mediated by interaction of the 56634 proteinwith a 56634 receptor. The features of the 56634 molecules of thepresent invention can provide similar biological activities asphosphatidylinositol 4-phosphate 5-kinase family members. For example,the 56634 proteins of the present invention can have one or more of thefollowing activities: (1) catalyses the formation ofphosphoinositol-4,5-bisphosphate via the phosphorylation ofphosphatidylinositol-4-phosphate; (2) mediates the phosphoinositidesignaling cascade; (3) mediates vesicular trafficking; or (4) mediatesorganization of the cytoskeleton. As a result, the 56634 protein mayhave a critical function in one or more of the following physiologicalprocesses: (a) vesicular secretion; (b) phosphoinositide signaling; or(c) cell proliferation and differentiation.

[0080] Several lines of evidence have shown coordinate increases inphosphatidylinositol and PIP kinase activities in human cancer cells,suggesting an increased capacity for signal transduction. Among PIPKs,two major subtypes (types I and II), each comprising two isoforms (Ia,Ib, Ia, IIb), have been identified to date. Type II phosphatidylinositolphosphate kinase (PIPKII) is an enzyme responsible for the synthesis ofphosphatidylinositol-4,5-bisphosphate (PI-4,5-P(2)) fromphosphatidylinositol-5-phosphate (PI-5-P). Mitogenic stimulation, suchas by serum, EGF, and PDGF treatment, results in phosphorylation in vivoof rat PIPKIIg (JBC 273:20292, 1998). In addition, PIPKIIb isoform hasalso been show to interact not only with the EGF receptor, but alsoselectively with other members of the ErbB tyrosine kinase family (CellSignal 11:171, 1999).

[0081] As described below, expression of 56634 is increased after thetreatment of mitogens, including EGF and serum. In addition expressionof 56634 is increased in-many clinical tumor tissues when compared tonormal tissue controls, suggesting an increased capacity for PIP kinasemediated signal transduction. Therefore, inhibition of 56634 may reducethe signaling potential of cancer cells, thereby halting and possiblyreducing the growth of tumor cells. Thus, the 56634 molecules can act asnovel diagnostic targets and therapeutic agents for controllingproliferation and differentiation related disorders.

[0082] Examples of such disorders include cancer, e.g., ovarian, breast,lung or colon cancer. Thus, the 56634 molecules can act as noveldiagnostic targets and therapeutic agents for controlling one or more ofcellular proliferative and/or differentiative disorders.

[0083] Identification and Characterization of Human 56634 cDNA

[0084] The human 56634 sequence (SEQ ID NO:5) is approximately 3224nucleotides long. The nucleic acid sequence includes an initiation codon(ATG) and a termination codon (TAA). The region between and inclusive ofthe initiation codon and the termination codon is a methionine-initiatedcoding sequence of about 1266 nucleotides, including the terminationcodon (nucleotides indicated as “coding” of SEQ ID NO:5; SEQ ID NO:7).The coding sequence encodes a 421 amino acid protein (SEQ ID NO:6).

[0085] Tissue Distribution of 56634 mRNA by TagMan Analysis and In SituHybridization

[0086] Endogenous human 56634 gene expression was determined using thePerkin-Elmer/ABI 7700 Sequence Detection System which employs TaqMantechnology.

[0087] To determine the level of 56634 in various human tissues aprimer/probe set was designed. Total RNA was prepared from a series ofhuman tissues using an RNeasy kit from Qiagen. First strand cDNA wasprepared from 1 μg total RNA using an oligo-dT primer and Superscript IIreverse transcriptase (Gibco/BRL). cDNA obtained from approximately 50ng total RNA was used per TaqMan reaction. Tissues tested include thehuman tissues and several cell lines shown in Tables 5-12, below.

[0088] TaqMan analysis revealed that the expression of 56634 wasincreased with addition of the growth factor EGF to serum free culturemedia of the SKOV3 ovarian cancer cell line for 15, 30 or 60 minutes(Table 5). The expression of 56634 was also similarly increased when thebreast cancer cell line MCF10A was treated with EGF for comparable timepoints (Table 6). 56634 was also shown to be induced in the HEY ovariancell line with the addition of serum following overnight serumstarvation (Table 7). When normal human ovarian epithelial cells (NOE)are compared with clinical ascites samples from several patients, 56634was found to be upregulated in the ascites samples compared to the NOE(Table 8). Clinical data comparing expression of 56634 in solid tumorvs. normal tissue counterpart (Table 9), and expression in Phase Inormal and diseased tissues (Table 10), all indicate that this gene isupregulated in tumor tissues compared to normal tissue counterparts.56634 is also expressed in several xenograft friendly cell lines (Table11). TABLE 5 TaqMan expression of 56634 in EGF Treated SKOV3 (OvarianCancer) Cells Tissue Type Expression SKOV-3 No EGF 4.6 SKOV-3 EGF ’155.7 SKOV-3 EGF ’30 7.1 SKOV-3 EGF ’60 5.3

[0089] TABLE 6 TaqMan expression of 56634 in EGF treated MCF10A cells(human breast cells) Tissue Type Expression MCF10A EGF 0 hr 110.0 MCF10AEGF 0.5 hr 115.4 MCF10A EGF 1 hr 170.2 MCF10A EGF 2 hr 97.1 MCF10A EGF 4hr 115.0 MCF10A EGF 8 hr 130.3

[0090] TABLE 7 Expression of 56634 in serum treated HEY (human ovariancancer) cells. Tissue Type Expression HEY 0 hr 5.0 HEY 1 hr 5.9 HEY 3 hr7.8 HEY 6 hr 6.1 HEY 9 hr 5.6 HEY 12 hr 5.4

[0091] TABLE 8 TaqMan expression of 56634 in Clinical Ascites samplesvs. NOE cells. Tissue Type Expression MDA 127 Normal Ovary 1.5 MDA 224Normal Ovary 0.5 MDA 124 Ovarian Ascites 1.8 MDA 126 Ovarian Ascites 5.1

[0092] TABLE 9 Oncology: Expression of 56634 in Normal (N), and Tumor(T), and metastatic (Met) Clinical Tissues Tissue Type Expression BreastN 7.6 Breast N 3.8 Breast N 2.6 Breast Tum: IDC-MD/PD 31.6 Breast T: IDC3.0 Breast Tum: IDC-PD 38.9 Breast T: IDC 1.5 Breast T ILC (LG) 10.5Lymph node (Breast met) 0.0 Lung (Breast met) 1.5 Ovary N 2.5 Ovary N1.9 Ovary T: PD-PS 6.4 Ovary T: MD-PS 2.7 Ovary T: PD-PS 13.0 Ovary T:PD-AC 2.1 Ovary T: MD/PD-PS 1.2 Lung N 0.7 Lung N 0.3 Lung N 3.1 LungT-SmC 27.1 Lung T: MD-SCC 22.6 Lung T: PD-NSCLC 1.6 Lung T: WD-AC 21.7Lung T: MD-AC 19.4 Lung T: MD-AC 6.8 Colon N 4.7 Colon N 1.3 Colon N 1.1Colon T: MD 22.4 Colon T: MD 44.0 Colon T 6.5 Colon T: MD-PD 34.2Colon-Liver Met 6.6 Colon-Liver Met 3.8 Liver N (female) 0.1 CervixSquamous CC 30.7 Cervix Squamous CC 2.0

[0093] TABLE 10 Phase I TaqMan expression of 56634 in Clinical TissuesTissue Type Expression Artery normal 13.5 Aorta diseased 0.0 Vein normal0.6 Coronary SMC 1.1 HUVEC 0.7 Hemangioma 0.0 Heart normal 1.6 Heart CHF1.6 Kidney 25.4 Skeletal Muscle 1.5 Adipose normal 0.0 Pancreas 0.0primary osteoblasts 1.7 Osteoclasts (diff) 0.1 Spinal cord normal 0.8Brain Cortex normal 208.0 Nerve 1.9 DRG (Dorsal Root Ganglion) 1.4Breast normal 1.8 Breast tumor 1.6 Ovary normal 0.0 Ovary Tumor 0.0Prostate Normal 5.4 Prostate Tumor 5.4 Salivary glands 1.8 Colon normal0.5 Colon Tumor 2.0 Lung normal 0.0 Lung tumor 20.7 Lung COPD 0.6 ColonIBD 0.8 Liver normal 0.0 Liver fibrosis 0.0 Spleen normal 0.0 Tonsilnormal 0.4 Lymph node normal 0.3 Small intestine normal 0.5 Macrophages0.0 Synovium 0.0 BM-MNC 0.0 Activated PBMC 0.1 Neutrophils 0.0Megakaryocytes 0.1 Erythroid 3.2 positive control 49.0 Skin normal 4.3Brain Hypothalamus normal 2.8

[0094] TABLE 11 TaqMan expression of 56634 in various xenofriendly celllines Tissue Type Expression MCF-7 Breast T 270.7 ZR75 Breast T 243.2T47D Breast T 327.6 MDA 231 Breast T 8.1 MDA 435 Breast T 8.4 SKBr3Breast 15.6 DLD 1 ColonT (stageC) 476.3 SW480 Colon T (stage B) 39.7HCT116 16.8 HT29 5.3 Colo 205 1.0 NCIH125 75.4 NCIH67 51.3 NCIH322 67.9NCIH460 12.5 A549 56.3 NHBE 114.2 SKOV-3 ovary 1.6 OVCAR-3 ovary 38.6293 Baby Kidney 87.5 293T Baby Kidney 120.7

[0095] In Situ Hubridization (ISH):

[0096] 56634 was found to be expressed by ISH in ovarian, breast andcolon tumor clinical samples. 56634 was localized to 0/3 normal ovarysamples, 6/12 ovarian tumors, 2/2 normal breast, 4/4 breast tumors, 0/1normal colon, 0/3 colon primary tumors, and 0/2 colon to livermetastases. See Table 12. TABLE 12 In Situ Hybridization expression of56634 in Clinical Human Tissues Spectrum Tissue Diagnosis Results Ovary:0/3 Normal; 6/12 Tumor CHT 2438 Ovary T Tumor (+/+) CHT 2433 Ovary TTumor (++/+) MDA 300 Ovary T Tumor (−/−) MDA 24 Ovary T Tumor (+/−) CLN346 Ovary T Tumor (−/−) CHT 2431 Ovary T Tumor (+/−) CHT 2430 Ovary TTumor (−/−) CHT 2432 Ovary T Tumor (+/+) CHT 2443 Ovary T Tumor (−/−)CHT 2429 Ovary T Tumor (++/+) MDA 222 Ovary T Tumor (−/−) CLN 356 OvaryT Tumor (−/−) CLN 572 Ovary N Normal ovarian stroma (−/−) CLN 571 OvaryN Normal ovarian stroma (−/−) CHT 619 Ovary N Normal ovarian stroma(−/−) Colon: 0/1 Normal; 0/3 Tumor; 0/2 Mets CHT 1877 Colon TAdenocarcinoma (−/−) CHT 1448 Colon T Adenocarcinoma (−/−) CHT 1855Colon T Adenocarcinoma (−/−) CHT 755 Colon M Metastatic tumor to theliver with (−/−) colonic origins CHT 866 Colon M Metastatic tumor to theliver with (−/−) colonic origins NDR 209 Colon N Normal colonicepithelium (−/−) Breast: 0/1 Normal; 2/4 Tumor CHT 1874 Breast T IDC(+/−) NDR 134 Breast T IDC (−/−) CHT 1837 Breast T ILC (−/−) CLN 662Breast T ILC (++/−) CHT 2248 Breast N Normal breast epithelial cells(−/−)

[0097] Human 55053 (EPK-55053)

[0098] The present invention is based, at least in part, on thediscovery of novel members of a family of molecules, referred to hereinas “Eukaryotic Protein Kinase-55053” or “EPK-55053” nucleic acid andpolypeptide molecules. Members of this family of molecules are able toparticipate in the modulation of the phosphorylation state of EPK-55053substrate molecules. By doing so, these molecules are able to contributeto the regulation and/or modulation of the activity of these substratemolecules, and, hence, the biochemical pathways with which thesubstrates are associated.

[0099] Protein kinases and phosphatases play critical roles in theregulation of biochemical and morphological changes associated withcellular growth and division (D'Urso, G. et al. (1990) Science250:786-791; Birchmeier, C. et al. (1993) Bioessays 15:185-189). Theyserve as growth factor receptors and signal transducers and have beenimplicated in cellular transformation and malignancy (Hunter, T. et al.(1992) Cell 70:375-387; Posada, J. et al. (1992) Mol. Biol. Cell3:583-592; Hunter, T. et al. (1994) Cell 79:573582). For example,protein kinases have been shown to participate in the transmission ofsignals from growth-factor receptors (Sturgill, T. W. et al. (1988)Nature 344:715-718; Gomez, N. et al. (1991) Nature 353:170-173), controlof entry of cells into mitosis (Nurse, P. (1990) Nature 344:503-508;Maller, J. L. (1991) Curr. Opin. Cell Biol. 3:269-275) and regulation ofactin bundling (Husain-Chishti, A. et al. (1988) Nature 334:718-721).

[0100] Protein kinases and phosphatases can be divided into differentgroups based on either amino acid sequence similarity or specificity foreither serine/threonine or tyrosine residues. A small number ofdual-specificity kinases and phosphatases have also been described.Within the broad classification, kinases and phosphatases can be furthersubdivided into families whose members share a higher degree ofcatalytic domain amino acid sequence identity and also have similarbiochemical properties. Most protein kinase and phosphatase familymembers also share structural features outside the kinase andphosphatase domain, respectively, that reflect their particular cellularroles. These include regulatory domains that control kinase orphosphatase activity or interaction with other proteins (Hanks, S. K. etal. (1988) Science 241:42-52).

[0101] In one embodiment, the EPK-55053 molecules of the presentinvention include at least one “transmembrane domain.” As used herein,the term “transmembrane domain” includes an amino acid sequence of about20-45 amino acid residues in length which spans the plasma membrane.More preferably, a transmembrane domain includes about at least 20, 25,30, 35, 40, or 45 amino acid residues and spans the plasma membrane.Transmembrane domains are rich in hydrophobic residues, and typicallyhave an alpha-helical structure. In a preferred embodiment, at least50%, 60%, 70%, 80%, 90%, 95% or more of the amino acids of atransmembrane domain are hydrophobic, e.g., leucines, isoleucines,alanines, valines, phenylalanines, prolines or methionines.Transmembrane domains are described in, for example, Zagotta W. N. etal. (1996) Annu. Rev. Neurosci. 19:235-263, the contents of which areincorporated herein by reference. Amino acid residues 214-231 of thehuman EPK-55053 polypeptide (SEQ ID NO:11) comprise a transmembranedomain.

[0102] To identify the presence of a transmembrane domain in anEPK-55053 protein, and make the determination that a protein of interesthas a particular profile, the amino acid sequence of the protein may besubjected to MEMSAT analysis. A MEMSAT analysis of the EPK-55053 proteinset forth as SEQ ID NO:11 results in the identification of atransmembrane domain in the amino acid sequence of human EPK-55053 (SEQID NO:11) at about residues 214-231 (having a score of 4.1). Two otherpotential transmembrane domains were also identified at about aminoacids 624-640 and 681-697 or SEQ ID NO:11.

[0103] In another embodiment, the EPK-55053 molecules of the presentinvention include at least one “eukaryotic protein kinase domain”. Asused herein, the term “eukaryotic protein kinase domain” includes aprotein domain having at least about 150-350 amino acid residues and abit score of at least 150 when compared against a eukaryotic proteinkinase domain Hidden Markov Model (HMM), e.g., PFAM Accession NumberPF00069. Preferably, a eukaryotic protein kinase domain includes aprotein having an amino acid sequence of about 190-320, 210-300, 250-260or more preferably about 252 amino acid residues, and a bit score of atleast 150, 210, 250, 290, or more preferably, 323.4. To identify thepresence of a eukaryotic protein kinase domain in an EPK-55053 protein,and make the determination that a protein of interest has a particularprofile, the amino acid sequence of the protein may be searched againsta database of known protein domains (e.g., the HMM database). Theeukaryotic protein kinase domain has been assigned the PFAM AccessionNo. PF00069 (see the PFAM website, available through the University ofWashington at St. Louis) and InterPro Accession No. IPR000719 (see thewebsite for the European Bioinformatics Institute). A search wasperformed against the HMM database resulting in the identification of aeukaryotic protein kinase domain in the amino acid sequence of humanEPK-55053 (SEQ ID NO:11) at about residues 34-285 of SEQ ID NO:11. Theidentified eukaryotic protein kinase domain is depicted as SEQ ID NO:14.

[0104] In another embodiment, the isolated nucleic acid molecules of thepresent invention encodes at least one “ubiquitin-associated domain” or“UBA domain”. As used interchangeably herein, the terms“ubiquitin-associated domain” and “UBA domain” include a protein domainhaving at least about 10-70 amino acid residues when compared against aUBA domain Hidden Markov Model (HMM), e.g., PFAM Accession NumberPF00627. Preferably, a UBA domain includes a protein having an aminoacid sequence of about 1070, 20-60, 30-50, 35-45 or more preferablyabout 40 amino acid residues, and a bit score of at least about 7.7. UBAdomains (described in, for example, Diekmann et al. (1998) Nat. Struct.Biol. 5:1042-1047) are domains that belong to an extensive family ofproteins which share a conserved sequence and which have associationswith ubiquitin and the ubiquitination pathway. To identify the presenceof a UBA domain in an EPK-55053 protein, and make the determination thata protein of interest has a particular profile, the amino acid sequenceof the protein may be searched against a database of known proteindomains (e.g., the HMM database). The UBA domain has been assigned thePFAM Accession No. PF00627 (see the PFAM website, available through theUniversity of Washington at St. Louis) and InterPro Accession No.IPR000449 (see the website for the European Bioinformatics Institute). Asearch was performed against the HMM database resulting in theidentification of a UBA domain in the amino acid sequence of humanEPK-55053 (SEQ ID NO:11) at about residues 315-356 of SEQ ID NO:11. Theidentified UBA domain is depicted in SEQ ID NO:15.

[0105] To elucidate the substrate specificity of the HPK-55053 proteinsof the present invention, further HMM analyses were performed using aproprietary database of Markov models, referred to herein as the SMARTHMM database. This analysis resulted in the identification of a serinethreonine kinase (“serkin_(—)6”) domain at about amino acids 34285 ofthe human EPK-55053 amino acid sequence set forth as SEQ ID NO:11.Notably, this serine/threonine kinase domain overlaps almost exclusivelywith the protein kinase domain identified by HMM searching of the PFAMdatabase, identifying the instant proteins as serine/threonine kinasesas compared to tyrosine kinases. This analysis also resulted in theidentification of a tyrosine kinase domain (“tyrkin_(—)6) at about aminoacid residues 34286 of SEQ ID NO:11. The identified serkin_(—)6 andtyrkin_(—)6 domains are depicted in SEQ ID NO:16 and 17, respectively.

[0106] Moreover, a signature sequence which is specific forserine/threonine kinases (consensus sequence given as SEQ ID NO:13) wasidentified at about residues 152-164 of SEQ ID NO:11. This signaturesequence occurs in the central part of the kinase catalytic domain ofserine/threonine kinases and contains a conserved aspartate residuewhich is important for the catalytic activity of the enzyme (Knighton D.R. et al. (1991) Science 253:407-414). The consensus signature sequencedescribed under the Prosite accession number PS00108 and is given as:

[0107] [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-K-x(2)-N-[LIVMFYCT](3) (SEQ IDNO:13)

[0108] A description of the Pfam database can be found in Sonhammer etal. (1997) Proteins 28:405-420 and a detailed description of HMMs can befound, for example, in Gribskov et al. (1990) Methods Enzymol.183:146-159; Gribskov et al. (1987) Proc. Natl. Acad. Sci. USA84:4355-4358; Krogh et al. (1994) J. Mol. Biol. 235:1501-1531; andStultz et al. (1993) Protein Sci. 2:305-314, the contents of which areincorporated herein by reference.

[0109] In a preferred embodiment, the EPK-55053 molecules of theinvention include at least one transmembrane domain and/or at least oneeukaryotic protein kinase domain, and/or at least one UBA domain.

[0110] Isolated EPK-55053 polypeptides of the present invention, have anamino acid sequence sufficiently identical to the amino acid sequence ofSEQ ID NO:11 or are encoded by a nucleotide sequence sufficientlyidentical to SEQ ID NO:10 or 12. As used herein, the term “sufficientlyidentical” refers to a first amino acid or nucleotide sequence whichcontains a sufficient or minimum number of identical or equivalent(e.g., an amino acid residue which has a similar side chain) amino acidresidues or nucleotides to a second amino acid or nucleotide sequencesuch that the first and second amino acid or nucleotide sequences sharecommon structural domains or motifs and/or a common functional activity.For example, amino acid or nucleotide sequences which share commonstructural domains having at least 60%, 65%, 70%, 75%, 76%, 80%, 85%,90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.1%, 99.2%, 99.3%,99.4%, 99.5%, 99.6%, 99.7%, 99.8%, 99.9% or more homology or identityacross the amino acid sequences of the domains and contain at least oneand preferably two structural domains or motifs, are defined herein assufficiently identical. Furthermore, amino acid or nucleotide sequenceswhich share at least 60%, 65%, 70%, 75%, 76%, 80%, 85%, 90%, 91%, 92%,93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.1%, 99.2%, 99.3%, 99.4%, 99.5%,99.6%, 99.7%, 99.8%, 99.9% or more homology or identity and share acommon functional activity are defined herein as sufficiently identical.

[0111] In a preferred embodiment, an EPK-55053 polypeptide includes atleast one or more of the following domains: a transmembrane domain, aeukaryotic protein kinase domain, a UBA domain, and has an amino acidsequence at least about 60%, 65%, 70%, 75%, 76%, 80%, 85%, 90%, 91%,92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.1%, 99.2%, 99.3%, 99.4%,99.5%, 99.6%, 99.7%, 99.8%, 99.9% or more homologous or identical to theamino acid sequence of SEQ ID NO:11, or the amino acid sequence encodedby the DNA insert of the plasmid deposited with ATCC as Accession Number______. In yet another preferred embodiment, an EPK-55053 polypeptideincludes at least one or more of the following domains: a transmembranedomain, a eukaryotic protein kinase domain, a UBA domain, and is encodedby a nucleic acid molecule having a nucleotide sequence which hybridizesunder stringent hybridization conditions to a complement of a nucleicacid molecule comprising the nucleotide sequence of SEQ ID NO:10 or SEQID NO:12. In another preferred embodiment, an EPK-55053 polypeptideincludes at least one or more of the following domains: a transmembranedomain, a eukaryotic protein kinase domain, a UBA domain, and has anEPK-55053 activity.

[0112] As used interchangeably herein, “EPK-55053 activity”, “biologicalactivity of EPK-55053” or “functional activity of EPK-55053”, includesan activity exerted by an EPK-55053 polypeptide or nucleic acid moleculeon an EPK-55053 responsive cell or tissue, or on an EPK-55053polypeptide substrate, as determined in vivo, or in vitro, according tostandard techniques. In one embodiment, an EPK-55053 activity is adirect activity, such as an association with an EPK-55053-targetmolecule. As used herein, a “target molecule” or “binding partner” is amolecule with which an EPK-55053 polypeptide binds or interacts innature, such that EPK-55053-mediated function is achieved. An EPK-55053target molecule can be a non-EPK-55053 molecule, for example, anon-EPK-55053 polypeptide. Additional, exemplary EPK-55053 targetmolecules can include lipid moieties, a lipid-associated moiety, or anucleic acid. In another embodiment, an EPK-55053 activity is anindirect activity, such as a cellular signaling activity mediated byinteraction of the EPK-55053 polypeptide with an EPK-55053 ligand.

[0113] In a preferred embodiment, an EPK-55053 polypeptide has one ormore of the following activities: (1) interaction with an EPK-55053substrate or target molecule (e.g., a non-EPK-55053 protein); (2)conversion of an EPK-55053 substrate or target molecule to a product(e.g., transfer of a phosphate group to a substrate or target molecule,or conversion of ATP to ADP); (3) interaction with and/or phosphatetransfer to a second non-EPK-55053 protein; (4) modulation of intra- orintercellular signaling and/or gene transcription (e.g., either directlyor indirectly); (5) modulation of the phosphorylation state of EPK-55053target molecules (e.g., a kinase or a phosphatase molecule) or thephosphorylation state of one or more proteins involved in cellulargrowth, metabolism, or differentiation, e.g., cardiac, epithelial, orneuronal cell growth or differentiation, as described in, for example,Lodish H. et al., Molecular Cell Biology (Scientific American BooksInc., New York, N.Y., 1995) and Stryer L., Biochemistry (W. H. Freeman,New York), the contents of which are incorporated herein by reference;(6) modulation of the activity of one or more proteins involved incellular growth or differentiation, e.g., cardiac, epithelial, orneuronal cell growth or differentiation; (7) modulation of expression ofone or more genes (e.g., a transcription factor); (8) modulation ofsignal transduction; and (9) participation in immunoregulation.

[0114] In other preferred embodiments, the EPK-55053 polypeptides of thepresent invention have one or more of the following activities: (1)modulation of cancer or tumor progression; (2) modulation of cellularproliferation; (3) modulation of tissue development (e.g.,embryogenesis); (4) modulation of differentiation; (5) modulation ofapoptosis; (6) modulation of energy metabolism; and (7) modulation of aubiquitination pathway. Thus, the EPK-55053 molecules of the presentinvention can participate in: (a) the regulation of transmission ofsignals from cellular receptors, e.g., growth factor receptors; (b) themodulation of the entry of cells into mitosis; (c) the modulation ofcellular differentiation; (d) the modulation of cell death; (e) theregulation of cytoskeleton function, e.g., actin bundling; and (f)metabolic pathways and the regulation of metabolic pathways.

[0115] The EPK-55053 molecules, by participating in the regulation ofphosphorylation states, provide novel diagnostic targets and therapeuticagents for controlling or treating a variety of kinase associateddisorders. As used herein, the term “kinase associated disorder” includedisorders, diseases, or conditions which are characterized by aberrant,e.g., upregulated, downregulated, or misregulated, protein kinaselevels. In a preferred embodiment, a kinase associated disorder includesthe inhibition or over-stimulation of the activity of kinases involvedin signaling pathways associated with cellular growth can lead toperturbed cellular growth, which can in turn lead to cellulargrowth-related disorders. As used herein, a “cellular growth-relateddisorder”, includes a disorder, disease, or condition characterized by aderegulation, e.g., an upregulation or a downregulation, of cellulargrowth. Cellular growth deregulation may be due to a deregulation ofcellular proliferation, cell cycle progression, cellular differentiationand/or cellular hypertrophy. Examples of cellular growth relateddisorders include cardiovascular disorders such as heart failure,hypertension, atrial fibrillation, dilated cardiomyopathy, idiopathiccardiomyopathy, or angina; proliferative disorders or differentiativedisorders such as cancer, e.g., melanoma, prostate cancer, cervicalcancer, breast cancer, colon cancer, or sarcoma.

[0116] Other examples of EPK-55053 associated disorders include CNSdisorders, cardiac-related disorders (cardiovascular disorders),disorders of the musculoskeletal system, hormonal disorders, immunedisorders, such as autoimmune disorders or immune deficiency disorders,e.g., congenital X-linked infantile hypogammaglobulinemia, transienthypogammaglobulinemia, common variable immunodeficiency, selective IgAdeficiency, chronic mucocutaneous candidiasis, or severe combinedimmunodeficiency.

[0117] EPK-55053 associated or related disorders also include disordersaffecting tissues in which EPK-55053 protein is expressed.

[0118] Isolation of the Human EPK-55053 cDNA

[0119] The invention is based, at least in part, on the discovery of ahuman gene encoding a novel 85.6 kD polypeptide, referred to herein ashuman EPK-55053. The entire sequence of the human clone 55053 wasdetermined and found to contain an open reading frame termed human“EPK-55053.” The nucleotide sequence of the human EPK-55053 genecontains 2980 nucleic acids and is set forth in the Sequence Listing asSEQ ID NO:10. The amino acid sequence of the human EPK-55053, containing778 amino acids, is set forth in the Sequence Listing as SEQ ID NO:11.The coding region (open reading frame) of SEQ ID NO:10 is set forth asSEQ ID NO:12. A plasmid containing the nucleotide sequence encodinghuman EPK-55053 was deposited with the American Type Culture Collection(ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209, on ______and assigned Accession Number ______. This deposit will be maintainedunder the terms of the Budapest Treaty on the International Recognitionof the Deposit of Microorganisms for the Purposes of Patent Procedure.This deposit was made merely as a convenience for those of skill in theart and is not an admission that a deposit is required under 35 U.S.C.§112.

[0120] Analysis of the Human EPK-55053 Molecules

[0121] A search using the polypeptide sequence of SEQ ID NO:11 wasperformed against the HMM database in PFAM resulting in theidentification of a eukaryotic protein kinase domain in the amino acidsequence of human EPK-55053 at about residues 34-285 of SEQ ID NO:11(score=323.4). Searching the SMART HMM database resulted in the furtheridentification of this domain as a serine threonine kinase domain. Theidentified eukaryotic protein kinase domain and serine threonine kinasedomain are depicted as SEQ ID NO:14, 16 and 17.

[0122] This search also resulted in the identification of a UBA domainin the amino acid sequence of human EPK-55053 at about residues 315-356of SEQ ID NO:11 (score=7.7). The identified UAB domain is depicted asSEQ ID NO:15.

[0123] A search using the polypeptide sequence of SEQ ID NO:11 was alsoperformed against the MEMSAT database, resulting in the identificationof potential transmembrane domains (score=4.1) in the amino acidsequence of human EPK-55053 (SEQ ID NO:11) at about residues 214-231,624-640, and 681-697.

[0124] Searches of the amino acid sequence of human EPK-55053 werefurther performed against the Prosite database. These searches resultedin the identification in the amino acid sequence of human EPK-55053 of apotential cAMP/cGMP-dependant protein kinase phosphorylation site(ProSite Accession No. PS00004) at about residues 272-275 of SEQ IDNO:11. A glycosaminoglycan attachment site (ProSite Accession No.PS00002) was also identified at about residues 682-685 of SEQ ID NO:11.Fifteen potential protein kinase C phosphorylation sites (ProSiteAccession No. PS00005) were identified at about residues 129-131,417-419, 427-429, 447-449, 472-474, 496-498, 508-510, 523-525, 555-557,563-565, 619-621, 643-645, 676-678, 699-701, and 758-760 of SEQ IDNO:11. Twelve potential casein kinase II sites (ProSite Accession No.PS00006) were identified at about residues 114-117, 129-132, 142-145,185-188, 311-314, 341-344, 363-366, 404-407, 575-578, 586-589, 668-671,and 715-718 of SEQ ID NO:1. Eleven potential N-myristoylation sites(ProSite Accession No. PS00008) were identified at about residues 4-9,10-15, 57-62, 435-440, 468-473, 485-490, 507-512, 530-535, 541-546,597-602, and 681-686 of SEQ ID NO:11. Three amidation sites (ProSiteAccession No. PS00009) were identified at about residues 208-211,300-303, and 390-393 of SEQ ID NO:11. Most notably, a serine/threonineprotein kinase active site signature (ProSite Accession No. PS00108) wasidentified at about residues 152-164 of SEQ ID NO:11.

[0125] The amino acid sequence of human EPK-55053 was analyzed using theprogram PSORT (available online; see Nakai, K. and Kanehisa, M. (1992)Genomics 14:897-911) to predict the localization of the proteins withinthe cell. This program assesses the presence of different targeting andlocalization amino acid sequences within the query sequence. The resultsof the analyses show that human EPK-55053 may be localized to thecytoplasm, nucleus, or mitochondria.

[0126] Further homologies of interest were identified by using the aminoacid sequence of EPK-55053 (SEQ ID NO:11) to search the ProDom database(available through the Institute National de la Recherche Agronomique,France). This search resulted in the identification of homology in theamino acid sequence of human EPK-55053 to a yeast probableserine/threonine protein kinase, a hypothetical 169.2 kD protein, atransmembrane kinase protein, a putative NPK-1 kinase, a C. elegansserine/threonine protein kinase, and HRPOPK-1 protein.

[0127] Human 2504, 15977 and 14760

[0128] The present invention is based, in part, on the discovery ofnovel protein kinase family members, referred to herein as “2504, 15977,and 14760”. The nucleotide sequence of a cDNA encoding 2504 is shown inSEQ ID NO:18, and the amino acid sequence of a 2504 polypeptide is shownin SEQ ID NO:19. In addition, the nucleotide sequence of the 2504 codingregion is depicted in SEQ ID NO:20. The nucleotide sequence of a cDNAencoding 15977 is shown in SEQ ID NO:21, and the amino acid sequence ofa 15977 polypeptide is shown in SEQ ID NO:22. In addition, thenucleotide sequence of the 15977 coding region is depicted in SEQ IDNO:23. The nucleotide sequence of a cDNA encoding 14760 is shown in SEQID NO:24, and the amino acid sequence of a 14760 polypeptide is shown inSEQ ID NO:25. In addition, the nucleotide sequence of the 14760 codingregion is depicted in SEQ ID NO:26.

[0129] Human 2504

[0130] The human 2504 sequence (SEQ ID NO:18), which is approximately2297 nucleotides long including untranslated regions, contains apredicted methionine-initiated coding sequence of about 1503 nucleotides(nucleotides 154-1656 of SEQ ID NO:18; SEQ ID NO:20). The codingsequence encodes a 501 amino acid protein (SEQ ID NO:19).

[0131] This mature protein form is approximately 501 amino acid residuesin length (from about amino acid 1 to amino acid 501 of SEQ ID NO:19).Human 2504 contains the following regions or other structural features:a eukaryotic protein kinase domain (PFAM Accession PF00069) located atabout amino acid residues 37 to 286 of SEQ ID NO:19; and aserine/threonine kinase domain located at about amino acid residues 24to 286 of SEQ ID NO:19.

[0132] The 2504 protein also includes the following domains: twelvepredicted Protein Kinase C phosphorylation sites (PS00005) located atabout amino acids 21 to 23, 46-48, 51-53, 91-93, 103-105, 118-120,138-140, 292-294, 422-424, 482-484, and 495-497 of SEQ ID NO:19; tenpredicted Casein Kinase II phosphorylation sites (PS00006) located atabout amino 7-10, 91-94, 103-106, 118-121, 276-279, 341-344, 364-367,470-473, 483-486, and 495-498 of SEQ ID NO:19; two predicted tyrosinekinase phosphorylation sites (PS00007) located at about amino acids127-135 and 484-491 of SEQ ID NO:19; two predicted N-myristoylationsites (PS00008) located at about amino acids 288-293 and 349-354 of SEQID NO:19; and one predicted amidation site located at about amino acids59-62 of SEQ ID NO:19.

[0133] For general information regarding PFAM identifiers, PS prefix andPF prefix domain identification numbers, refer to Sonnhammer et al.(1997) Protein 28:405-420.

[0134] A plasmid containing the nucleotide sequence encoding human 2504(clone Fbh2504FL) was deposited with American Type Culture Collection(ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209, on ______and assigned Accession Number ______. This deposit will be maintainedunder the terms of the Budapest Treaty on the International Recognitionof the Deposit of Microorganisms for the Purposes of Patent Procedure.This deposit was made merely as a convenience for those of skill in theart and is not an admission that a deposit is required under 35 U.S.C.§112.

[0135] Human 15977

[0136] The human 15977 sequence (SEQ ID NO:21), which is approximately4417 nucleotides long including untranslated regions, contains apredicted methionine-initiated coding sequence of about 1377 nucleotides(nucleotides 337-1713 of SEQ ID NO:21; SEQ ID NO:23). The codingsequence encodes a 459 amino acid protein (SEQ ID NO:22).

[0137] This mature protein form is approximately 459 amino acid residuesin length (from about amino acid 1 to amino acid 459 of SEQ ID NO:22).Human 15977 contains the following regions or other structural features:a eukaryotic protein kinase domain (PFAM Accession PF00069) located atabout amino acid residues 44 to 276 of SEQ ID NO:22; and aserine/threonine kinase domain located at about amino acid residues 44to 329 of SEQ ID NO:22.

[0138] The 15977 protein also includes the following domains: twopredicted N-glycosylation sites (PS00001) located at about amino acids370-373 and 388-391 of SEQ ID NO:22; two cAMP- and cGMP-dependentprotein kinase phosphorylation sites (PS00004) located at about aminoacids 270-273 and 451-454 of SEQ ID NO:22; nine predicted Protein KinaseC phosphorylation sites (PS00005) located at about amino acids 14-16,137-139, 199-201, 214-216, 229-231, 258-260, 269-271, 355-357, and373-375 of SEQ ID NO:22; eight predicted Casein Kinase II sites(PS00006) located at about amino 96-99, 124-127, 150-153, 229-232,258-261, 273-276, 355-358, and 411-414 of SEQ ID NO:22; two predictedN-myristoylation sites (PS00008) located at about amino 30-35 and422-427 of SEQ ID NO:22; one predicted amidation site (PS00009) locatedat about amino acids 46-49 of SEQ ID NO:22; and a Serine/Threonineprotein kinase active-site signature (PS 00108) located at about aminoacids 160-172 of SEQ ID NO:22.

[0139] For general information regarding PFAM identifiers, PS prefix andPF prefix domain identification numbers, refer to Sonnhammer et al.(1997) Protein 28:405-420.

[0140] A plasmid containing the nucleotide sequence encoding human 15977(clone Fbh15977FL) was deposited with American Type-Culture Collection(ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209, on ______and assigned Accession Number ______. This deposit will be maintainedunder the terms of the Budapest Treaty on the International Recognitionof the Deposit of Microorganisms for the Purposes of Patent Procedure.This deposit was made merely as a convenience for those of skill in theart and is not an admission that a deposit is required under 35 U.S.C.§112.

[0141] Human 14760

[0142] The human 14760 sequence (SEQ ID NO:24), which is approximately2046 nucleotides long including untranslated regions, contains apredicted methionine-initiated coding sequence of about 1788 nucleotides(nucleotides 119-1906 of SEQ ID NO:24; SEQ ID NO:26). The codingsequence encodes a 596 amino acid protein (SEQ ID NO:25).

[0143] This mature protein form is approximately 596 amino acid residuesin length (from about amino acid 1 to amino acid 596 of SEQ ID NO:25).Human 14760 contains the following regions or other structural features:a eukaryotic protein kinase domain (PFAM Accession PF00069) located atabout amino acid residues 285 to 540 of SEQ ID NO:25; and aserine/threonine kinase domain located at about amino acid residues 285to 540 of SEQ ID NO:25.

[0144] The 14760 protein also includes the following domains: twopredicted N-glycosylation sites (PS00001) located at about amino acids278-281 and 416-419 of SEQ ID NO:25; three cAMP- and cGMP-dependentprotein kinase phosphorylation sites (PS00004) located at about aminoacids 140-143, 317-320, and 583-586 SEQ ID NO:25; eleven predictedProtein Kinase C phosphorylation sites (PS00005) located at about aminoacids 17-19, 49-51, 59-61, 107-109, 159-161, 203-205, 224-226, 235-237,247-249, 320-322, and 460-462 of SEQ ID NO:25; eight predicted CaseinKinase II phosphorylation sites (PS00006) located at about amino157-160, 184-187, 203-206, 247-250, 301-304, 320-323, 351-354, and379-382 of SEQ ID NO:25; one predicted tyrosine kinase phosphorylationsites (PS00007) located at about amino acids 370-376 of SEQ ID NO:25;nine predicted N-myristoylation sites (PS00008) located at about aminoacids 83-88, 116-121, 135-140, 178-183, 241-246, 277-282, 293-298,308-313, and 589-594 of SEQ ID NO:25; one predicted amidation site(PS00009) located at about amino acids 128-131 of SEQ ID NO:25; aprotein kinases ATP-binding region signature located at about aminoacids 291-299 of SEQ ID NO:25; and a Serine/Threonine protein kinaseactive-site signature (PS 00108) located at about amino acids 402-414 ofSEQ ID NO:25.

[0145] For general information regarding PFAM identifiers, PS prefix andPF prefix domain identification numbers, refer to Sonnhammer et al.(1997) Protein 28:405-420.

[0146] A plasmid containing the nucleotide sequence encoding human 14760(clone Fbh14760FL) was deposited with American Type Culture Collection(ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209, on ______and assigned Accession Number ______. This deposit will be maintainedunder the terms of the Budapest Treaty on the International Recognitionof the Deposit of Microorganisms for the Purposes of Patent Procedure.This deposit was made merely as a convenience for those of skill in theart and is not an admission that a deposit is required under 35 U.S.C.§112. TABLE 13 Summary of Domains of 2504, 15977, and 14760 ProteinProtein Kinase Domain Serine/Threonine Kinase Domain 2504 About aminoacids 37-286 About amino acids 24-286 of SEQ ID NO: 19 of SEQ ID NO: 1915977 About amino acids 44-276 About amino acids 44-329 of SEQ ID NO: 22of SEQ ID NO: 22 14760 About amino acids 285-540 About amino acids285-540 of SEQ ID NO: 25 of SEQ ID NO: 25

[0147] The 2504, 15977, and 14760 proteins contains a significant numberof structural characteristics in common with members of the proteinkinase family. The term “family” when referring to the protein andnucleic acid molecules of the invention means two or more proteins ornucleic acid molecules having a common structural domain or motif andhaving sufficient amino acid or nucleotide sequence homology as definedherein. Such family members can be naturally or non-naturally occurringand can be from either the same or different species. For example, afamily can contain a first protein of human origin as well as otherdistinct proteins of human origin, or alternatively, can containhomologues of non-human origin, e.g., rat or mouse proteins. Members ofa family can also have common functional characteristics.

[0148] A 2504, 15977, or 14760 polypeptide can include a “protein kinasedomain” or regions homologous with a “protein kinase domain”.

[0149] As used herein, the term “protein kinase” includes a protein orpolypeptide which is capable of modulating its own phosphorylation stateor the phosphorylation state of another protein or polypeptide. Proteinkinases play critical roles in the regulation of biochemical andmorphological changes associated with cellular growth and division(D'Urso, G. et al. (1990) Science 250: 786-791; Birchmeier. C. et al.(1993) Bioessays 15: 185-189). They serve as growth factor receptors andsignal transducers and have been implicated in cellular transformationand malignancy (Hunter, T. et al. (1992) Cell 70: 375-387; Posada, J. etal. (1992) Mol. Biol. Cell 3: 583-592; Hunter, T. et al. (1994) Cell 79:573-582). For example, protein kinases have been shown to participate inthe transmission of signals from growth-factor receptors (Sturgill, T.W. et al. (1988) Nature 344: 715-718; Gomez, N. et al. (1991) Nature353: 170-173), control of entry of cells into mitosis (Nurse, P. (1990)Nature 344: 503-508; Maller, J. L. (1991) Curr. Opin. Cell Biol. 3:269-275) and regulation of actin bundling (Husain-Chishti, A. et al.(1988) Nature 334: 718-721).

[0150] Protein kinases can have a specificity for (i.e., a specificityto phosphorylate) serine/threonine residues, tyrosine residues, or bothserine/threonine and tyrosine residues, e.g., the dual specificitykinases. As referred to herein, protein kinases preferably include acatalytic domain of about 200-400 amino acid residues in length,preferably about 200-300 amino acid residues in length, or morepreferably about 250-300 amino acid residues in length. Specificity of aprotein kinase for phosphorylation of either tyrosine orserine/threonine can be predicted by the sequence of two of thesubdomains (VIb and VIII) in which different residues are conserved ineach class (as described in, for example, Hanks et al. (1988) Science241:42-52) the contents of which are incorporated herein by reference).These subdomains are also described in further detail herein.

[0151] Protein kinases play a role in signaling pathways associated withcellular growth. For example, protein kinases are involved in theregulation of signal transmission from cellular receptors, e.g.,growth-factor receptors; entry of cells into mitosis; and the regulationof cytoskeleton function, e.g., actin bundling. Thus, the molecules ofthe present invention may be involved in: 1) the regulation oftransmission of signals from cellular receptors, e.g., cell growthfactor receptors; 2) the modulation of the entry of cells, e.g.,precursor cells, into mitosis; 3) the modulation of cellulardifferentiation; 4) the modulation of cell death; and 5) the regulationof cytoskeleton function, e.g., actin bundling.

[0152] Inhibition or over stimulation of the activity of protein kinasesinvolved in signaling pathways associated with cellular growth can leadto perturbed cellular growth, which can in turn lead to cellular growthrelated disorders. As used herein, a “cellular growth related disorder”includes a disorder, disease, or condition characterized by aderegulation, e.g., an upregulation or a downregulation, of cellulargrowth. Cellular growth deregulation may be due to a deregulation ofcellular proliferation, cell cycle progression, cellular differentiationand/or cellular hypertrophy. Examples of cellular growth relateddisorders include cardiovascular disorders such as heart failure,hypertension, atrial fibrillation, dilated cardiomyopathy, idiopathiccardiomyopathy, or angina; proliferative disorders or differentiativedisorders such as cancer, e.g., melanoma, prostate cancer, cervicalcancer, breast cancer, colon cancer, or sarcoma.

[0153] As used herein, the term “protein kinase domain” includes anamino acid sequence of about 150 to 400 amino acid residues in lengthand having a bit score for the alignment of the sequence to the proteinkinase domain (HMM) of at least 50. Preferably, a protein kinase domainincludes at least about 200-400 amino acids, more preferably about200-300 amino acid residues, or about 220-270 amino acids and has a bitscore for the alignment of the sequence to the protein kinase domain(HMM) of at least 120 or greater. The protein kinase domain (HMM) hasbeen assigned the PFAM Accession PF00069. An alignment of the proteinkinase domain (amino acids 37 to 286 of SEQ ID NO:19) of human 2504 witha consensus amino acid sequence derived from a hidden Markov modelyields a score of 229.1 (E=6.5e−65). The identified protein kinasedomain of 2504 is depicted in SEQ ID NO:27. An alignment of the proteinkinase domain (amino acids 44 to 276 of SEQ ID NO:22) of human 15977with a consensus amino acid sequence derived from a hidden Markov modelyields a score of 123.3 (E=4.3e−33). The identified protein kinasedomain of 15977 is depicted in SEQ ID NO:29. An alignment of the proteinkinase domain (amino acids 285 to 540 of SEQ ID NO:25) of human 14760with a consensus amino acid sequence derived from a hidden Markov modelyields a score of 251.1 (E=1.5e−71). The identified protein kinasedomain of 2504 is depicted in SEQ ID NO:30.

[0154] In a preferred embodiment 2504, 15977, or 14760 polypeptide orprotein has a “protein kinase domain” or a region which includes atleast about 200-400 more preferably about 200-300 or 220-270 amino acidresidues and has at least about 70% 80% 90% 95%, 99%, or 100% homologywith a “protein kinase domain,” e.g., the protein kinase domain of human2504, 15977, or 14760 (e.g., residues 37-286 of SEQ ID NO:19; residues44-276 of SEQ ID NO:22, or residues 285-540 of SEQ ID NO:25).

[0155] A 2504, 15977, or 14760 molecule can further include a“serine/threonine kinase domain.”

[0156] As used herein, the term “serine/threonine kinase domain”includes an amino acid sequence of about 150 to 400 amino acid residuesin length and having a bit score for the alignment of the sequence tothe protein kinase domain (HMM) of at least 15. Preferably, aserine/threonine kinase domain includes at least about 200-400 aminoacids, more preferably about 200-300 amino acid residues, or about220-270 amino acids and has a bit score for the alignment of thesequence to the serine/threonine kinase domain (HMM) of at least 50 orgreater. An alignment of the serine/threonine kinase domain (amino acids24 to 286 of SEQ ID NO:19) of human 2504 with a consensus amino acidsequence derived from a hidden Markov model yields a score of 284.1(E=1.8e−81). An alignment of the serine/threonine kinase domain (aminoacids 44 to 329 of SEQ ID NO:22) of human 15977 with a consensus aminoacid sequence derived from a hidden Markov model yields a score of 64.9(E=1.8e−15). An alignment of the serine/threonine kinase domain (aminoacids 285 to 540 of SEQ ID NO:25) of human 14760 with a consensus aminoacid sequence derived from a hidden Markov model yields a score of 296.2(E=4e−85). The identified serine/threonine kinase domains in 2504, 15977and 14760 is depicted in SEQ ID NO:28.

[0157] In a preferred embodiment 2504, 15977, or 14760 polypeptide orprotein has a “serine/threonine kinase domain” or a region whichincludes at least about 200-400 more preferably about 200-300 or 220-270amino acid residues and has at least about 70% 80% 90% 95%, 99%, or 100%homology with a “serine/threonine kinase domain,” e.g., theserine/threonine kinase domain of human 2504, 15977, or 14760 (e.g.,residues 24-286 of SEQ ID NO:19; residues 44-329 of SEQ ID NO:22, orresidues 285-540 of SEQ ID NO:25).

[0158] To identify the presence of a “protein kinase” domain or a“serine/threonine kinase” domain in a 2504, 15977, or 14760 proteinsequence, and make the determination that a polypeptide or protein ofinterest has a particular profile, the amino acid sequence of theprotein can be searched against a database of HMMs (e.g., the Pfamdatabase, release 2.1) using the default parameters. For example, thehmmsf program, which is available as part of the HMMER package of searchprograms, is a family specific default program for MILPAT0063 and ascore of 15 is the default threshold score for determining a hit.Alternatively, the threshold score for determining a hit can be lowered(e.g., to 8 bits). A description of the Pfam database can be found inSonhammer et al. (1997) Proteins 28(3):405-420 and a detaileddescription of HMMs can be found, for example, in Gribskov et al. (1990)Meth. Enzymol. 183:146-159; Gribskov et al. (1987) Proc. Natl. Acad.Sci. USA 84:4355-4358; Krogh et al. (1994) J. Mol. Biol. 235:1501-1531;and Stultz et al. (1993) Protein Sci. 2:305-314, the contents of whichare incorporated herein by reference.

[0159] A 2504, 15977, or 14760 family member can include a proteinkinase domain, e.g. a serine/threonine kinase domain.

[0160] As the 2504, 15977, or 14760 polypeptides of the invention maymodulate 2504, 15977, or 14760-mediated activities, they may be usefulas of for developing novel diagnostic and therapeutic agents for 2504,15977, or 14760-mediated or related disorders, as described below.

[0161] As used herein, a “2504, 15977, or 14760 activity”, “biologicalactivity of 2504, 15977, or 14760” or “functional activity of 2504,15977, or 14760”, refers to an activity exerted by a 2504, 15977, or14760 protein, polypeptide or nucleic acid molecule on e.g., a 2504,15977, or 14760-responsive cell or on a 2504, 15977, or 14760 substrate,e.g., a protein substrate, as determined in vivo or in vitro. In oneembodiment, a 2504, 15977, or 14760 activity is a direct activity, suchas an association with a 2504, 15977, or 14760 target molecule. A“target molecule” or “binding partner” is a molecule with which a 2504,15977, or 14760 protein binds or interacts in nature, e.g., a proteincontaining one or more serine and or threonine residues. A 2504, 15977,or 14760 activity can also be an indirect activity, e.g., a cellularsignaling activity mediated by interaction of the 2504, 15977, or 14760protein with a 2504, 15977, or 14760 receptor. For example, the 2504,15977, or 14760 proteins of the present invention can have one or moreof the following activities: 1) the regulation of transmission ofsignals from cellular receptors, e.g., cell growth factor receptors; 2)the modulation of the entry of cells, e.g., precursor cells, intomitosis; 3) the modulation of cellular differentiation; 4) themodulation of cell death; 5) the regulation of cytoskeleton function,e.g., actin bundling; or 6) the ability to phosphorylate a substrate.

[0162] Based on the above-described sequence similarities, the 2504,15977, and 14760 molecules of the present invention are predicted tohave similar biological activities as protein kinase family members.Thus, the 2504, 15977, and 14760 molecules can act as novel diagnostictargets and therapeutic agents for controlling one or more of cellularproliferative and/or differentiative disorders, disorders associatedwith bone metabolism, immune disorders, hematopoietic disorders,cardiovascular disorders, liver disorders, viral diseases, pain ormetabolic disorders.

[0163] In addition, the 2504, 15977, and 14760 molecules of theinvention may modulate physiological and pathological processes in thecells or tissues where they are expressed. For example, Taq Man studiesdescribed herein show abundant expression of 2504, 15977, and 14760mRNAs in neural tissues, including the brain cortex and hypothalamus.15977 mRNA is also highly expressed in epithelial cells, astrocytes(glial cells), HUVEC cells, smooth muscle cells and fetal liver. 14760mRNA is also abundantly expressed in the fetal liver, endothelial cells,fetal heart, fibroblasts, bone marrow glycophorin-positive cells,hepatocytes, cardiovascular cells, and skeletal muscle. Accordingly,these molecules can act as novel diagnostic targets and therapeuticagents of disorders involving the cells or tissues where they areexpressed, e.g., neural (e.g., brain or astrocytic) disorders;cardiovascular and blood vessel (smooth muscle or endothelial cell)disorders; immune disorders (e.g., disorders involvingglycophorin-positive cells); hepatic or liver disorders; skin disorders;skeletal disorders, among others.

[0164] Identification and Characterization of Human 2504, 15977, or14760 cDNA and Genomic Sequence

[0165] The human 2504 sequence (SEQ ID NO:18), which is approximately2297 nucleotides long including untranslated regions, contains apredicted methionine-initiated coding sequence of about 1503 nucleotides(nucleotides 154-1656 of SEQ ID NO:18; SEQ ID NO:20). The codingsequence encodes a 501 amino acid protein (SEQ ID NO:19).

[0166] The human 15977 sequence (SEQ ID NO:21), which is approximately4417 nucleotides long including untranslated regions, contains apredicted methionine-initiated coding sequence of about 1377 nucleotides(nucleotides 337-1713 of SEQ ID NO:21; SEQ ID NO:23). The codingsequence encodes a 459 amino acid protein (SEQ ID NO:22).

[0167] The human 14760 sequence (SEQ ID NO:24), which is approximately2046 nucleotides long including untranslated regions, contains apredicted methionine-initiated coding sequence of about 1788 nucleotides(nucleotides 119-1906 of SEQ ID NO:24; SEQ ID NO:26). The codingsequence encodes a 596 amino acid protein (SEQ ID NO:25).

[0168] Tissue Distribution of 2504, 15977, or 14760 mRNA

[0169] Endogenous human 2504, 15977, and 14760 gene expression wasdetermined using the Perkin-Elmer/ABI 7700 Sequence Detection Systemwhich employs TaqMan technology.

[0170] To determine the level of 2504, 15977, and 14760 in various humantissues a primer/probe set was designed using Primer Express(Perkin-Elmer) software and primary cDNA sequence information. Total RNAwas prepared from a series of human tissues using an RNeasy kit fromQiagen. First strand cDNA was prepared from 1 μg total RNA using anoligo-dT primer and Superscript II reverse transcriptase (Gibco/BRL).cDNA obtained from approximately 50 ng total RNA was used per TaqManreaction. 2504, 15977, and 14760 mRNA levels were analyzed in a varietyof samples of human tissues

[0171] Relative 2504 mRNA expression was determined by TaqMan assays onmRNA derived from the following tissues: monkey cortex; monkey dorsalroot ganglion; monkey spinal cord; monkey sciatic nerve; monkey kidney;monkey hairy skin; monkey heart left ventricle; monkey gastro muscle;monkey liver; human brain; human spinal cord; human heart; human kidney;human liver; and human lung. The highest 2504 mRNA expression wasobserved in monkey cortex, human brain, and monkey and human spinalcords.

[0172] Relative 15977 mRNA expression was determined by TaqMan assays onmRNA derived from the following human tissues: (1) Aorta/normal; (2)Fetal heart/normal; (3) Heart normal; (4) Heart/congestive heart failure(CHF); (5) Vein/Normal; (6) Smooth muscle cells (SMC) (Aortic); (7)Spinal cord/Normal; (8) Brain cortex/Normal; (9) Brainhypothalamus/Normal; (10) Glial cells (Astrocytes); (11)Brain/Glioblastoma; (12) Breast/Normal; (13) Breast tumor/(invasivecarcinoma (IDC); (14) Ovary/Normal; (15) Ovary/Tumor; (16) Pancreas;(17) Prostate/Normal; (18) Prostate/Tumor; (19) Colon/normal; (20)Colon/tumor; (21) Colon/IBD; (22) Kidney/normal; (23) Liver/normal; (24)Liver fibrosis; (25) Fetal Liver/normal; (26) Lung/normal; (27)Lung/tumor; (28) Lung/COPD; (29) Spleen/normal; (30) Tonsil/normal; (31)Lymph node/normal; (32) Thymus/normal; (33) Epithelial Cells (prostate);(34) Endothelial Cells (aortic); (35) Skeletal Muscle/Normal; (36)Fibroblasts (Dermal); (37) Skin/normal; (38) Adipose/Normal; (39)Osteoblasts (primary); (40) Osteoblasts (undifferentiated); (41)Osteoblasts (Diff); (42) Osteoclasts; (43) Aortic smooth muscle cells(SMC) Early; (44) Aortic SMC Late; (45) Shear human umbilical veinendothelial cells (HUVEC); and (46) Static HUVEC. Elevated 15977 mRNAexpression was observed in epithelial cells, astrocytes (glial cells),normal brain (e.g., cortex and hypothalamus), HUVEC, and normal fetalliver.

[0173] Relative 14760 mRNA expression was determined by TaqMan assays onmRNA derived from the following human tissues: (1) Aorta/Normal; (2)Fetal Heart/Normal; (3) Heart/Normal; (4) Heart/CHF; (5) Vein/Normal;(6) SMC/aortic; (7) Nerve; (8) Spinal Cord/Normal; (9) BrainCortex/Normal; (10) Brain hypothalamus; (11) Glial Cells (astrocytes);(12) Glioblastoma; (13) Breast/Normal; (14) Breast/IDC; (15)Ovary/Normal; (16) Ovary/Tumor; (17) Pancreas; (18) Prostate/Normal;(19) Prostate/tumor adenocarcinoma; (20) Colon/Normal; (21) Colon/Tumor;(22) Colon/IBD; (23) Kidney/Normal; (24) Liver/Normal; (25)Liver/Fibrosis; (26) Fetal Liver/Normal; (27) Lung/Normal; (28) COPD;(29) Spleen/Normal; (30) Tonsil/Normal; (31) Lymph Node/Normal; (32)Thymus/Normal; (33) Epithelial Cells; (34) Endothelial cells; (35)Skeletal Muscle/Normal; (36) Fibroblasts; (37) Skin/Normal; (38)Adipose/normal; (39) Osteoblast/Primary; (40)Osteoblast/undifferentiated; (41) Osteoblast/differentiated; and (42)Osteoclasts. Elevated 14760 mRNA expression was observed in normal brain(e.g., cortex and hypothalamus), and normal fetal liver and fetal heart.

[0174] Relative 14760 mRNA expression was determined by TaqMan assays onmRNA derived from the following tissues and cell lines: (1) Heart; (2)Lung; (3) Kidney; (4) Fetal Liver; (5) Spleen; (6) Granulocytes; (7)NHDF mock; (8) NHLF mock; (9) NHLF TGF; (10) HepG2 Mock; (11) HepG2 TGF;(12) Pass Stell; (13) Liver Pool; (14) Control liver; (15) LF/NDR 191;(16) LF/NDR 193; (17) LF/NDR 079; (18) LN NDR 173; (19) Tonsil; (20) TH124 hr. MP39; (21) TH2 24 hr. MP39; (22) TH1 24 hr. MP21; (23) TH2 24 hr.MP21; (24) CD4; (25) CD8; (26) CD19; (27) CD3 MP42 rest; (28) CD14; (29)PBMC MOCK; (30) Bone marrow mononuclear cells (BM MNC); (31)CD34-positive cells (MPB CD34+); (32) Bone marrow glycophorin-positivecells (BM GPA+); (33) Cord Blood; (34) Erythroid; (35) Megakaryocytes;(36) Neutrophils (Neut) after 14 days in culture (d14); (37)CD14−/CD15+; (38) MBM CD11b; (39) HepG2; (40) HepG2.2.15; (41) MAI 01;(42) HL60; (43) K562; (44) Molt 4; (45) Hep3B Normoxia; and (46) Hep3BHypoxia. Elevated 14760 mRNA expression was observed in pass stell, bonemarrow glycophorin-positive cell lines, MOLT-4 cell lines and fetalliver.

[0175] Relative 14760 mRNA expression was determined using acardiovascular organ panel by TaqMan assays on mRNA derived from thefollowing cardiovascular tissues: normal atria; normal left ventricle;diseased right ventricle; diseased left ventricle; kidney; liver; andskeletal muscle. Elevated 14760 mRNA expression was observed in skeletalmuscle and cardiovascular tissues.

[0176] Human 25501

[0177] The invention is based, at least in part, on the discovery of anovel transferase referred to herein as “25501”. The human 25501sequence (SEQ ID NO:31), which is approximately 1971 nucleotides longincluding untranslated regions, contains a predictedmethionine-initiated coding sequence of about 1512 nucleotides,including the termination codon (nucleotides indicated as coding of SEQID NO:31; SEQ ID NO:33). The coding sequence encodes a 503 amino acidprotein (SEQ ID NO:32).

[0178] Human 25501 contains the following regions or other structuralfeatures (for general information regarding PFAM identifiers, PS prefixand PF prefix domain identification numbers, refer to Sonnhammer et al.(1997) Protein 28:405-420: a transfer domain (ProDom No. PD034341, SEQID NO:34) located at about amino acid residues 280 to 411 of SEQ IDNO:32; a recognition/binding domain located at about amino acid residues30 to 250 of SEQ ID NO:32; six protein kinase C phosphorylation sites(Prosite PS00005) located at about amino acids 47 to 49, 126 to 128, 178to 180, 181 to 183, 206 to 208, and 210 to 212 of SEQ ID NO:32; tencasein kinase II phosphorylation sites (Prosite PS00006) located atabout amino acids 10 to 13, 41 to 44, 54 to 57, 126 to 129, 179 to 182,222 to 225, 292 to 295, 357 to 360, 431 to 434, and 456 to 459 of SEQ IDNO:32; one cAMP/cGMP-dependent protein kinase phosphorylation site(Prosite PS00004) located at about amino acids 414 to 417 of SEQ IDNO:32; one tyrosine kinase phosphorylation site (Prosite PS00007)located at about amino acids 318 to 325 of SEQ ID NO:32; one amidationsite (Prosite PS00009) located at about amino acids 377 to 380 of SEQ IDNO:32; and six N-myristoylation sites (Prosite PS00008) located at aboutamino acids 103 to 108, 281 to 286, 327 to 332, 337 to 342, 437 to 442,and 449 to 454 of SEQ ID NO:32.

[0179] A plasmid containing the nucleotide sequence encoding human25501, named Fbh25501FL, was deposited with American Type CultureCollection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209,on ______ and assigned Accession Number ______. This deposit will bemaintained under the terms of the Budapest Treaty on the InternationalRecognition of the Deposit of Microorganisms for the Purposes of PatentProcedure. This deposit was made merely as a convenience for those ofskill in the art and is not an admission that a deposit is requiredunder 35 U.S.C. §112.

[0180] The 25501 protein contains a significant number of structuralcharacteristics in common with members of the transferase family, inparticular, of methyltransferases. In general, transferases catalyze thetransfer of one molecular group from a donor molecule to an acceptormolecule. Examples of such molecular groups include phosphate, amino,methyl, acetyl, acyl, phosphatidyl, phosphoribosyl, among other groups.The methyltransferase family is a large superfamily of enzymes thatregulate biological processes by catalyzing the transfer of methylgroups to a wide variety of endogenous and exogenous compounds,including DNA, RNA, proteins, hormones, neurotransmitters, drugs, andxenobiotics (Weinshilboum et al. (1999) Annu. Rev. Pharmacol. Toxicol.39:19-52).

[0181] Methylation of DNA can play an important role in the control ofgene expression in mammalian cells. DNA methyltransferases are involvedin DNA methylation and catalyze the transfer of a methyl group fromS-adenosylmethionine to cytosine residues to form 5-methylcytosine, amodified base that is found mostly at CpG sites in the genome. Thepresence of methylated CpG islands in the promoter region of genes cansuppress their expression. This process may be due to the presence of5-methylcytosine, which apparently interferes with the binding oftranscription factors or other DNA-binding proteins to blocktranscription. In different types of tumors, aberrant or accidentalmethylation of CpG islands in the promoter region has been observed formany cancer-related genes, resulting in the silencing of theirexpression. Such genes include tumor suppressor genes, genes thatsuppress metastasis and angiogenesis, and genes that repair DNA(Momparler and Bovenzi (2000) J. Cell Physiol. 183:145-54).

[0182] Methylation of proteins is a post-translational modificationwhich can regulate the activity and subcellular localization of numerousproteins. Methylation of proteins can play an important role in proteinrepair and reversal of protein aging. Proteins undergo a variety ofspontaneous degradation processes, including oxidation, glycation,deamidation, isomerization, and racemization. These non-enzymaticmodifications can produce functionally damaged species that reflect theaction of aging at the molecular level (Stadtman (1992) Science257:1220-1224; Martin et al. (1996) Nat. Genet. 13:25-34). Methylationof these damaged proteins e.g., by protein L-isoaspartylmethyltransferase (Shimizu et al. (2000) Arch. Biochem. Biophys.381:225-34) can play a part in the repair pathway. Protein methylationis also known to be important in cellular stress responses (Desrosiersand Tanguay (1988) J. Biol. Chem. 263:4686-4692). Moreover, proteinmethyltransferases have recently been demonstrated to be important incellular signaling events, for example, in receptor-mediated and/ordifferentiation-dependent signaling (Lin et al. (1996) J. Biol. Chem.271:15034-15044; Abramovich et al. (1997) EMBO J. 16:260266).

[0183] Methylation is a process important for the catabolism of smallmolecules, such as thiol compounds and neurotransmitters. A deficiencyin thiol compound detoxification by methylation is being investigatedfor its role in rheumatoid arthritis (Waring and Emery (1993) BaillieresClin. Rheumatol. 6:337-50). Inhibition of dopamine methylation andinactivation by catechol-O-methyl transferase is a goal for therapy ofParkinson's disease (Goldstein and Lieberman (1992) Neurology42(suppl):8-12).

[0184] As used herein, the term “transferase” includes a protein orpolypeptide which is capable of catalyzing the transfer of a moleculargroup from a donor molecule to an acceptor molecule. In order tocatalyze molecular group transfer, the transferases must recognize orbind the group's donor then catalyze the transfer of the group to anacceptor molecule. In the process, the transferase itself can become anintermediate acceptor molecule, e.g., the alkylation of an active sitecysteine in O(6)-alkylguanine-DNA alkyltransferase (Daniels and Tainer(2000) Mutat. Res. 460:151-163). Members of a transferase family ofproteins typically are cytoplasmic or nuclear proteins. Transferases,e.g. methyltransferases typically include conserved motifs, including atleast one Prosite methyltransferase signature sequence, e.g. PS01261,PS00092, or PS01184. The 25501 molecules of the invention includeregions homologous to these motifs.

[0185] A 25501 polypeptide can include a “transfer domain” or a regionhomologous with a “transfer domain”.

[0186] As used herein, the term “transfer domain” includes an amino acidsequence of about 50 to 250 amino acid residues in length and includesone, two, preferably three sequences homologous to the Prosite methylaseor methyltransferase signature sequences PS01261, PS00092, and PS01184.Preferably, a transfer domain includes at least about 100 to 200 aminoacids, more preferably about 120 to 150 amino acid residues, or about130 to 140 amino acids and includes one, two, preferably three sequenceshomologous to Prosite methylase or methyltransferase signature sequencesPS01261, PS00092, and PS01184. Preferably the Prosite sequences arearranged in the following order, first the PS01261, second the PS00092,third the PS01184 and are spaced about sixty amino acids or less fromeach other. Preferably a transfer domain catalyzes the transfer of agroup, e.g. a methyl group from a donor to an acceptor molecule. Thetransfer domain of 25501 can be found at about amino acid residues 280to 411 of SEQ ID NO:32.

[0187] A sequence similar to the Prosite sequence PS01261, the putativeRNA methylase family UPF0020 signature,D-P-[LIVMF]-C-G-[ST]-G-x(3)-[LI]-E (SEQ ID NO:36) can be found in human25501 at about amino acid residues 304 to 315 of SEQ ID NO:32, except anL replaces the [ST]. A sequence similar to the Prosite sequence PS00092,the N-6 adenine-specific DNA methylase signature,[LIVMAC]-[LIVFYWA]-x-[DN]-P-P[FYW] (SEQ ID NO:37) can be found in human25501 at about amino acid residues 371 to 377 of SEQ ID NO:32, except anI replaces the first P. A sequence similar to the Prosite sequencePS01184, the ubiE/COQ5 methyltransferase family signature 2,R-V-[LIVM]-K[PV]-[GM]-G-x-[LIVMF]-x(2)-[LIVM]-E-x-S (SEQ ID NO:38) canbe found in human 25501 at about amino acid residues 396 to 409 of SEQID NO:32, except an H replaces the K and the last three residues areL-S-E instead of E-x-S. In the above conserved signature sequences, andother motifs or signature sequences described herein, the standard IUPACone-letter code for the amino acids is used. Each element in the patternis separated by a dash (-); square brackets ([ ]) indicate theparticular residues that are accepted at that position; x indicates thatany residue is accepted at that position; and numbers in parentheses (()) indicate the number of residues represented by the accompanying aminoacid.

[0188] The transfer domain of the human 25501 protein is homologous,e.g., at least about 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%,36%, 37%, 38%, 39%, 40%, or 41% identical to the ProDom family PD034341(“VNG2242C Y71F9AL.1 MTH724 PH0338 AF1257 MJ0710 APE1835”) domain(ProDomain Release 2001.1). The ProDom PD034341 domain and can includeone, two, preferably three Prosite methylase or methyltransferasesignature sequences or sequences homologous to these sequences spacedsixty amino acids or less apart. A GAP alignment of the transfer domain(amino acids 280 to 411 of SEQ ID NO:32) of human 25501 with amino acidresidues 1 to 133 of the 172 amino acid PD034341 domain consensussequence (SEQ ID NO:34), derived from a BLAST search model results in32% identity (as calculated from the blosum62 matrix).

[0189] In a preferred embodiment, a 25501 polypeptide or protein has a“transfer domain” or a region which includes at least about 100 to 200more preferably about 120 to 150 or 130 to 140 amino acid residues andhas at least about 60%, 70% 80% 90% 95%, 99%, or 100% homology with a“transfer domain,” e.g., the transfer domain of human 25501 (e.g.,residues 280 to 411 of SEQ ID NO:32).

[0190] Regions similar to the transfer domain are found in otherproteins. For example, a transfer domain can be found in MGC:2454 (SEQID NO:35, accession number 13278783 in GenPept; corresponding to numberBC004163 in GenBank). MGC:2454 is homologous to the 25501 protein in SEQID NO:32. An alignment of the 25501 protein with MGC:2454 results inabout 94% overall sequence identity between the two sequences. Sequenceidentity of 100% can be found in regions beginning about amino acid 1 to473 of MGC:2454 (SEQ ID NO:35) with amino acids about 31 to 503 of25501, SEQ ID NO:32 (as calculated in matblas from the blosum62.iijmatrix).

[0191] To make the determination that the “transfer” domain in a 25501protein sequence or a polypeptide or protein of interest has aparticular profile, the amino acid sequence of the protein can besearched against a database of domains, e.g., the ProDom database(Corpet et al. (1999), Nucl. Acids Res. 27:263-267). The ProDom proteindomain database consists of an automatic compilation of homologousdomains. Current versions of ProDom are built using recursive PSI-BLASTsearches (Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402; Gouzyet al. (1999) Computers and Chemistry 23:333-340) of the SWISS-PROT 38and TREMBL protein databases. The database automatically generates aconsensus sequence for each domain. A BLAST search was performed againstthe database resulting in the PD034341 profile of the “transfer” domainin the amino acid sequence of human 25501 at about residues 280 to 411of SEQ ID NO:32.

[0192] A 25501 molecule can further include a recognition/binding domainor regions homologous with a “recognition/binding domain.” As usedherein, the recognition/binding domain” includes an amino acid sequenceof about 100 to 350 amino acid residues in length and whose secondarystructure is characterized by a high alpha helical content. Table 14,below, illustrates the prediction of the likelihood of amino acidresidues from this region of 25501 to belong to an element of secondarystructure by two prediction methods. TABLE 14 Secondary StructurePrediction of Amino Acid Residues 117 to 198 of SEQ ID NO: 32 A B C D 30V H H 31 M H H 32 R H H 33 E H H 34 V H H 35 R H H 36 A H H 37 R H H 38L H H 39 A H H 40 A H H 41 T H H 42 Q H H 43 V H H 44 E H H 45 Y . H 46I . T 47 S t T 48 G t T 49 K . T 50 V . B 51 F . B 52 F . B 53 T . B 54T t B 55 C t H 56 S T H 57 D T H 58 L . H 59 N . H 60 M H H 61 L H H 62K H H 63 K H H 64 L H H 65 K H H 66 S H H 67 A H H 68 E H H 69 R B H 70L B H 71 F B H 72 L B H 73 L B H 74 I B H 75 K . H 76 K . H 77 Q B B 78F B B 79 P B B 80 L B B 81 I B B 82 I B B 83 S . B 84 S . . 85 V . . 86S . . 87 K h . 88 G h . 89 K h . 90 I h . 91 F h B 92 N h B 93 E h B 94M b B 95 Q b . 96 R b . 97 L b . 98 I b . 99 N b . 100 E t . 101 D t .102 P T T 103 G T T 104 S T T 105 W B . 106 L B . 107 N B . 108 A B .109 I B . 110 S B . 111 I B . 112 W B H 113 K B H 114 N B H 115 L B H116 L B H 117 E H H 118 L H H 119 D H H 120 A H H 121 K H H 122 K H H123 E H H 124 K H H 125 L H H 126 S H H 127 Q H H 128 R H H 129 D t H130 D T H 131 N T H 132 Q H H 133 L H . 134 K H . 135 R H . 136 K H .137 V H . 138 G H H 139 E H H 140 N H H 141 E H H 142 I H H 143 I H H144 A H H 145 K H H 146 K H H 147 L H H 148 K H H 149 I H H 150 E H H151 Q H H 152 M H H 153 Q H H 154 K H H 155 I H H 156 E H H 157 E H H158 N . H 159 R T H 160 D T H 161 C t H 162 Q H H 163 L H H 164 E H H165 K H H 166 Q H H 167 I H H 168 K H H 169 E H H 170 E H H 171 T H H172 L H H 173 E H H 174 Q H H 175 R H H 176 D H H 177 F H H 178 T H H179 T H . 180 K H . 181 S H . 182 E H . 183 K H H 184 F H H 185 Q H H186 E H H 187 E H H 188 E H H 189 F H H 190 Q t H 191 N t H 192 D H H193 I H H 194 E H H 195 K H H 196 A H H 197 I H H 198 D H H 199 T t .200 H t . 201 N t . 202 Q T T 203 N T T 204 D t T 205 L B T 206 T B T207 F B T 208 R B T 209 V B T 210 S . T 211 C t T 212 R T T 213 C T T214 S T T 215 G T T 216 T . T 217 I . . 218 G . . 219 K H . 220 A H .221 F H H 222 T H H 223 A H H 224 Q H H 225 E H H 226 V H H 227 G . H228 K . H 229 V . H 230 I . H 231 G . H 232 I h H 233 A h H 234 I h H235 M h H 236 K h H 237 H h H 238 F h T 239 G h T 240 W h T 241 K h .242 A h . 243 D h . 244 L h . 245 R t . 246 N t . 247 P t . 248 Q t .249 L . B 250 E . B #Osguthorpe-Robson secondary structure predictionfor that AA (Garnier et al. (1978) J. Mol. Biol. 120: 97-120).

[0193] As shown in Table 14, the prediction methods agree that themajority of residues in this region, in particular, residues 117 to 198of SEQ ID NO:32, can form alpha helices. Proteins can use alpha helicesto recognize and bind nucleic acid molecules. For example, thehelix-turn-helix DNA binding domain is involved in a variety ofprotein-DNA interactions (Wintjens and Rooman (1996) J. Mol. Biol.262:294-313), with variations in additional helices and helixarrangements distinguishing protein families from one another. Proteinscan use alpha helices to determine the specificity of ligandinteractions. For example, amino acid residues on helices in the ligandbinding pocket of steroid receptors allow the discrimination betweendifferent steroid hormones (Ekena et al. (1998) J. Biol. Chem.273:693-699).

[0194] In a preferred embodiment, a 25501 polypeptide or protein has a“recognition/binding domain” or a region which includes at least about150 to 300 more preferably about 180 to 260 or 210 to 230 amino acidresidues and has at least about 60%, 70% 80% 90% 95%, 99%, or 100%homology with a “recognition/binding domain,” e.g., therecognition/binding domain of human 25501 (e.g., residues 30 to 250 ofSEQ ID NO:32).

[0195] To identify the presence of a “recognition/binding” domain in a25501 protein sequence, and make the determination that a polypeptide orprotein of interest has a particular profile, the amino acid sequence ofthe protein can be analyzed by a secondary structure prediction methodthat predicts the secondary structure of proteins based on thecharacteristics of each amino acid (Chou and Fasman (1974) Biochemistry13:222-244 and Garnier et al. (1978) J. Mol. Biol. 120:97-120).

[0196] A 25501 family member can include at least one transfer domain. A25501 family member also can include at least one recognition/bindingdomain. Furthermore, a 25501 family member can include at least one,two, three, four, five preferably six protein kinase C phosphorylationsites (Prosite PS00005); at least one, two, three, four, five, six,seven, eight, nine and preferably ten casein kinase II phosphorylationsites (Prosite PS00006); at least one tyrosine kinase phosphorylationsite (Prosite PS00007); at least one cAMP/cGMP protein kinasephosphorylation sites (Prosite PS00004); at least one amidation site(Prosite PS00009); and at least one, two, three, four, five preferablysix N-myristoylation sites (Prosite PS00008).

[0197] Polypeptides of the invention include fragments which include:all or part of a hydrophobic sequence, e.g., the sequence from aboutamino acid 258 to 267, from about 353 to 363, and from about 100 to 108of SEQ ID NO:32; all or part of a hydrophilic sequence, e.g., thesequence from about amino acid 121 to 132, from about 150 to 160, andfrom about 410 to 423 of SEQ ID NO:32; a sequence which includes a Cys,or a glycosylation site.

[0198] As the 25501 polypeptides of the invention can modulate25501-mediated activities, they can be useful for developing noveldiagnostic and therapeutic agents for transferase-associated or other25501-associated disorders, as described below.

[0199] As used herein, a “transferase-associated activity” includes anactivity which involves a transfer function, e.g. the transfer of agroup, e.g. a methyl group from a donor molecule to an acceptormolecule. This function is implicated in a wide range of cellactivities, including, but not limited to cell growth and cellprocesses, e.g., the regulation of cell proliferation, differentiation,migration, protein transport, gene expression, and/or intra- orintercellular signaling, and apoptosis. Members of the family can play arole in cancer, developmental syndromes, such as Fragile X and Rett(El-Osta and Wolf (2000) Gene Expr. 9:63-75), neurodegenerativedisorders such as Alzheimer's disease (Shimizu et al. (2000) Arch.Biochem. Biophys. 381:225-34), and Parkinson's disease (Goldstein andLieberman (1992) Neurology 42 (suppl4):8-12), and inflammatory disorderssuch as rheumatoid arthritis (Waring and Emery (1992) Baillieres Clin.Rheumatol. 6:337-50).

[0200] As used herein, a “25501 activity”, “biological activity of25501” or “functional activity of 25501”, refers to an activity exertedby a 25501 protein, polypeptide or nucleic acid molecule on e.g., a25501-responsive cell or on a 25501 substrate, e.g., a proteinsubstrate, as determined in vivo or in vitro. In one embodiment, a 25501activity is a direct activity, such as an association with a 25501target molecule. A “target molecule” or “binding partner” is a moleculewith which a 25501 protein binds or interacts in nature. In an exemplaryembodiment, 25501 is a transferase, e.g., a methyltransferase, and thushas the ability to bind to, or interact with, a substrate or targetmolecule, e.g., a nucleic acid molecule (e.g. DNA or RNA), a smallorganic molecule (e.g., a hormone, a neurotransmitter or a coenzyme), ora protein; and/or the ability to transfer a group, e.g. a methyl groupfrom a donor to an acceptor molecule, e.g. the substrate or targetmolecule.

[0201] A 25501 activity can also be an indirect activity, e.g., acellular signaling activity mediated by interaction of the 25501 proteinwith a 25501 receptor. Based on the above-described sequence structuresand similarities to molecules of known function, the 25501 molecules ofthe present invention can have similar biological activities astransferase family members. For example, the 25501 proteins of thepresent invention can have one or more of the following activities: (1)the ability to interact with a 25501 substrate or target molecule (e.g.,a non-25501 protein); (2) the ability to convert a 25501 substrate ortarget molecule to a product (e.g., transfer of a methyl group to orfrom the substrate or target molecule); (3) the ability to interact withand/or methyl transfer to a second non-25501 target molecule e.g., anucleic acid molecule (e.g., DNA or RNA), a small organic molecule(e.g., a hormone, neurotransmitter or a coenzyme) or a protein; (4) theability to regulate substrate or target molecule activity; (5) theability to modulate intra- or intercellular signaling and/or genetranscription (e.g., either directly or indirectly); (6) the ability tomodulate cellular targeting and/or transport of proteins; (7) theability to modulate cellular proliferation, growth, or differentiation;(8) the ability to modulate cell migration and/or (9) the ability tomodulate apoptosis.

[0202] The 25501 molecules of the invention can modulate the activitiesof cells in tissues where they are expressed. For example, 25501 mRNA isexpressed in brain, in particular the astrocytes, which provide physicaland biochemical support for neurons and interact with capillaryendothelial cells to form the blood-brain barrier. 25501 mRNA also canbe found in the ovary and prostate epithelium. 25501 mRNA also isexpressed in tissues undergoing large amounts of growth, differentiationand angiogenesis such as fetal and neonatal kidney, fetal heart andfetal adrenal gland. 25501 mRNA also is expressed in cancerous tissue,especially malignant tumors, such as Wilm's tumor, lung tumor, colontumor, metastases of colon tumor in the liver, metastases of prostatetumor in the liver, metastases of breast tumors in the lung and brain.Accordingly, the 25501 molecules of the invention can act as noveldiagnostic targets or therapeutic agents for neurological disorders,ovarian disorders, prostate disorders, or proliferative and/ordifferentiative disorders or other transferase disorders.

[0203] Gene Expression Analysis of 25501 by TaqMan® Analysis

[0204] Human 25501 expression was measured by TaqMan® quantitative PCR(Perkin Elmer Applied Biosystems) in cDNA prepared from a variety ofnormal and diseased (e.g., cancerous) human tissues or cell lines.

[0205] The results indicate significant 25501 expression in brain, e.g.glial cells (e.g. a high level in astrocytes); a medium level in theovary; in the prostate e.g. a medium level in prostate epithelium; intissues undergoing large amounts of growth, differentiation andangiogenesis, e.g. medium levels in the fetus and neonate (e.g. fetaland neonatal kidney fetal heart and fetal adrenal gland); and incancerous tissue, e.g. tumors (e.g. medium levels in lung tumor, colontumor and metastases of colon tumor in the liver, and high levels inWilm's tumor and metastases of prostate tumor in the liver).

[0206] Transcriptional Profiling

[0207] The expression profiles of samples of metastatic brain and lungtumors originating from human breast adenocarcinoma tumors were comparedwith the profiles samples from primary human breast adenocarcinomatumors. Total RNA was isolated from the tissue samples. Reversetranscriptase was used to generate ³³P-dCTP-labeled cDNAs from the RNA.These experimental tissue cDNAs were hybridized to an array of moleculeswith known sequences. The nylon array contained 9600 elements, each witha PCR product from cDNA clones of the known genes. The hybridizationlevels from each tissue sample are measured and compared. Comparisonsresulting in at least a 1.5-fold difference were judged as significant.The 25501 transcript was identified as being upregulated in the lung andbrain metastatic tumors originating from human breast adenocarcinomatumors.

[0208] Human 17903

[0209] The present invention is based, at least In part, on thediscovery of a novel aminopeptidase referred to herein as “17903”. Thepresent invention provides the human 17903 sequence (SEQ ID NO:39),which is approximately 3034 nucleotides long including untranslatedregions, contains a predicted methionine-initiated coding sequence ofabout 2178 nucleotides (nucleotides 18 to 2195 of SEQ ID NO:39; SEQ IDNO:41). The coding sequence encodes a 725 amino acid protein (SEQ IDNO:40).

[0210] The 17903 protein includes a Pfam Peptidase family M1 consensusdomain, as well as Prodom consensus domains for aminopeptidases. Forgeneral information regarding PFAM identifiers, PS prefix and PF prefixdomain identification numbers, refer to Sonnhammer et al. (1997) Protein28:405-420.

[0211] The 17903 protein contains a significant number of structuralcharacteristics in common with members of the aminopeptidase M1 familyof metallopeptidases. Aminopeptidases (APs) are a group of widelydistributed exopeptidases that catalyze the hydrolysis of amino acidresidues from the amino-terminus of polypeptides and proteins. Theenzymes are found in plant and animal tissues, in eukaryotes andprokaryotes, and in secreted and soluble forms. Biological functions ofaminopeptidases include protein maturation, terminal degradation ofproteins, hormone level regulation, and cell-cycle control.

[0212] Aminopeptidases are implicated in a host of conditions anddisorders including aging, cancers, inflammatory diseases, cataracts,cystic fibrosis and leukemias. In eukaryotes, APs are associated withremoval of the initiator methionine. In prokaryotes the methionine isremoved by methionine aminopeptidase subsequent to removal of theN-formyl group from the initiator N-formyl methionine, facilitatingsubsequent modifications such as N-acetylation and N-myristoylation. InE. coli AP-A (pepA), the xerB gene product is required for stabilizationof unstable plasmid multimers.

[0213] APs are also involved in the metabolism of secreted regulatorymolecules, such as hormones and neurotransmitters, and modulation ofcell-cell interactions. In mammalian cells and tissues, the enzymes areapparently required for terminal stages of protein degradation, andEGF-induced cell-cycle control; and may have a role in protein turnoverand selective elimination of obsolete or defective proteins.Furthermore, the enzymes are implicated in the supply of amino acids andenergy during starvation and/or differentiation, and degradation oftransported exogenous peptides to amino acids for nutrition. APs mayalso have a role in inflammation. Industrial uses of the enzymes includemodification of amino termini in recombinantly expressed proteins. SeeA. Taylor (1993) TIBS 18:1993:167-172.

[0214] Aminopeptidases have been identified in a wide variety of tissuesand organisms, including zinc aminopeptidase and aminopeptidase M fromrat kidney membrane; human aminopeptidase N from intestine; arginineaminopeptidase from liver; aminopeptidase N^(b) from muscle;leukotriene-A4 hydrolase; leucine aminopeptidase (LAP) from bovine andhog lens and kidney; aminopeptidase A (xerB gene product) from E. coli;yscl APE1/LAP4 and aminopeptidase A (pep4 gene product) from S.cerevisiae; LAP from aeromonas; dipeptidase from mouse ascites;methionine aminopeptidase from salmonella, E. coli, S. cerevisiae andhog liver; and D-amino acid aminopeptidase from ochrobactrum anthropiSCRC C1-38.

[0215] As used herein, the term “aminopeptidase” refers to a protein orpolypeptide that is capable of catalyzing the cleavage of a polypeptidebond at the amino terminus of a polypeptide molecule through hydrolysis(i.e., possessing amino-terminal polypeptide hydrolytic activity orexopeptidase activity). As referred to herein, aminopeptidasespreferably include a catalytic domain of about 150-350 amino acidresidues in length, preferably 200-300 amino acid residues in length, ormore preferably 220-280 amino acids in length. Based on the sequencesimilarities described above, the 17903 molecules of the presentinvention are predicted to have similar biological activities asaminopeptidase family members.

[0216] As the biological functions of aminopeptidases include proteinmaturation and protein degradation, they typically play a role indiverse cellular processes. In particular, aminopeptidases have beenshown to have a role in tumor growth, metastasis, and angiogenesis; ininflammatory disorders including, but not limited to osteoarthritis andrheumatoid arthritis, multiple sclerosis, Crohn disease, psoriasis,periodontal disease, and asthma; in cataracts; in cystic fibrosis; inleukemias; and in aging.

[0217] A 17903 polypeptide can include an “aminopeptidase zinc-bindingmotif” or regions homologous with the “Peptidase M1 family ofaminopeptidases”.

[0218] As used herein, the term “Peptidase M1 family of aminopeptidasesdomain” includes an amino acid sequence having a bit score for thealignment of the sequence to the Peptidase M1 family domain (M) of atleast 8. Preferably, a peptidase M1 family of aminopeptidases domainincludes at least about 150-350 amino acids, more preferably 200-300amino acids, or about 220-280 amino acids and has a bit score for thealignment of the sequence to the aminopeptidase domain (HMM) of at least16 or greater. The Peptidase M1 family (HMM) has been assigned the PFAMAccession PF01433. An alignment of the Peptidase M1 family ofaminopeptidases domain of human 17903 (amino acids 195 to 445 of SEQ IDNO:40) with the consensus amino acid sequences derived from a hiddenMarkov model yields a bit score for the alignment of the sequence to theamino-peptidase domain (HMM) of 172 (E=4.3e−59). The identifiedconsensus amino acid sequence for the Peptidase M1 family ofaminopeptidases is depicted in SEQ ID NO:42.

[0219] In a preferred embodiment 17903 polypeptide or protein has a“peptidase M1 family of aminopeptidases domain” or a region whichincludes at least about 60%, 70%, 80%, 90%, 95%, 99%, or 100% homologywith the Peptidase M1 family of aminopeptidases (e.g., amino acidresidues 195 to 445 of SEQ ID NO:40).

[0220] To identify the presence of a Peptidase M1 aminopeptidase regionof homology in a 17903 protein sequence, and make the determination thata polypeptide or protein of interest has a particular profile, the aminoacid sequence of the protein can be searched against a database of HMMs(e.g., the Pfam database, release 2.1) using the default parameters. Forexample, the hmmsf program, which is available as part of the HMMERpackage of search programs, is a family specific default program forMILPAT0063 and a score of 15 is the default threshold score fordetermining a hit. Alternatively, the threshold score for determining ahit can be lowered (e.g., to 8 bits). A description of the Pfam databasecan be found in Sonhammer et al. (1997) Proteins 28(3):405-420 and adetailed description of HMMs can be found, for example, in Gribskov etal. (1990) Meth. Enzymol. 183:146-159; Gribskov et al. (1987) Proc.Natl. Acad. Sci. USA 84:4355-4358; Krogh et al. (1994) J. Mol. Biol.235:1501-1531; and Stultz et al. (1993) Protein Sci. 2:305-314, thecontents of which are incorporated herein by reference.

[0221] As the 17903 polypeptides of the invention may modulate17903-mediated activities, they may be useful for developing noveldiagnostic and therapeutic agents for 17903-mediated or relateddisorders, as described below.

[0222] As used herein, a “17903 activity”, “biological activity of17903” or “functional activity of 17903”, refers to an activity exertedby a 17903 protein, polypeptide or nucleic acid molecule on e.g., a17903-responsive cell or on a 17903 polypeptide substrate, as determinedin vivo or in vitro. In one embodiment, a 17903 activity is a directactivity, such as an association with a 17903 target molecule. A “targetmolecule” or “binding partner” or “ligand” or “substrate” is a moleculewith which a 17903 protein binds or interacts in nature, e.g., apolypeptide that a 17903 protein cleaves. A 17903 activity can also bean indirect activity, e.g., a cellular signaling activity mediated byinteraction of the 17903 protein with a 17903 ligand. For example, the17903 proteins of the present invention can have one or more of thefollowing activities: 1) cleavage of a protein precursor to maturation;2) catalysis of protein degradation; 3) regulation of hormone levels; 4)modulation of tumor cell growth and invasion; 5) modulation ofangiogenesis; and 6) regulation of cell proliferation.

[0223] Polypeptides of the invention include fragments which include:all or a part of a hydrophobic sequence, e.g. residues from about 317 to352 of SEQ ID NO:40; or all or part of a hydrophilic fragment, e.g.residues from about 676 to 704 of SEQ ID NO:40. Other fragments includea cysteine residue or an N-glycosylation site.

[0224] The expression profile for 17903 is depicted in Tables 15-29below. As depicted in tables 15-29, 17903 is up-regulated inproliferating endothelial cells compared to arrested endothelial cellsin 5 out of 5 independent experiments. 17903 is further upregulated insome lung, breast, ovary, and brain tumors as compared to normaltissues. 17903 is expressed in hemanginomas and the expression levels inhemanginomas are 30-50 fold higher than the expression level in normalskin. In addition, 17903 is expressed in other angiogenic tissues suchas Wilms tumors, uterine adenocarcinoma, neuroblastoma, fetal adrenalgland, and fetal kidney. Mouse 17903 is up-regulated in VEGF plugs ascompared to parental plugs in the xenograft model. In the RIP-Taq mousemodel, the expression of 17903 is up-regulated in tumor islets and theexpression levels of 17903 correlate to the expression levels of VEGF atvarious stages of tumor development.

[0225] Expression of 17903 was measured in various clinical samples byin situ hybridization. 17903 was weakly expressed in one of two breasttumor epithelial cell samples, but not in either of two normal breastsamples. Three of four primary colon tumor and metastases were positivefor 17903 expression, while 17903 was not detected in the normal coloncontrol. 17903 was expressed in five of seven samples of malignantepithelium of several histologically different lung tumor subtypes, butwas not detected in the normal lung control sample. 17903 was expressedin both malignant ovary epithelium and normal stroma of the ovary.

[0226] The methods of the present invention are most relevant to thosenormal and diseased tissues where 17903 is expressed, including thetissues described above as well as those shown in Tables 15-29 below.The expression pattern of 17903 in human samples and mouse modelssuggest that 17903 plays a positive role in cellular proliferation(including endothelial proliferation), tumor angiogenesis, and/ortumorogenesis. Accordingly, inhibition of 17903 function may inhibittumor angiogenesis and tumor growth.

[0227] Identification and Characterization of Human 17903 cDNAs

[0228] The human 17903 sequence (SEQ ID NO:39), which is approximately3034 nucleotides long including untranslated regions, contains apredicted methionine-initiated coding sequence of about 2175 nucleotides(nucleotides 18-2192 of SEQ ID NO:39; SEQ ID NO:41). The coding sequenceencodes a 725 amino acid protein (SEQ ID NO:40).

[0229] Tissue Distribution of 17903 mRNA

[0230] The expression of 17903 was monitored in various tissues and celltypes by quantitative PCR (TaqMan® brand quantitative PCR kit, AppliedBiosystems) according to the kit manufacture's instructions. The resultsare shown below in Tables 15-29. TABLE 15 EXPRESSION OF 17903 IN HUMANANGIOGENESIS-RELATED TISSUES Average Average Relative Tissue Type Beta 217903.1 Beta 2 Δ Ct Expression Hemangioma 31.84 19.89 11.95 0.25Hemangioma 26.23 19.04 7.19 6.87 Hemangioma 26.06 19.46 6.60 10.34Normal Kidney 28.12 21.52 6.60 10.34 Renal Cell Carcinoma 30.00 20.569.44 1.44 Wilms Tumor 25.85 19.26 6.59 10.38 Wilms Tumor 29.70 22.667.04 7.63 Skin 34.65 22.36 12.29 0.20 Uterine 27.03 19.34 7.69 4.86Adenocarcinoma Neuroblastoma 27.29 20.11 7.18 6.90 Fetal Adrenal 26.8418.41 8.43 2.90 Fetal Kidney 27.67 20.97 6.70 9.62 Fetal Heart 24.9018.62 6.28 12.87 Normal Heart 25.72 19.66 6.06 14.99 Cartilage 34.8924.99 9.91 1.04 Spinal cord 28.12 20.78 7.34 6.17 lymphangiona 33.1924.61 8.58 2.62 Endometrial polyps 36.06 26.25 9.81 1.11 Synovium (RA)31.25 23.11 8.14 3.56 Hyperkeratotic skin 30.30 23.43 6.87 8.55

[0231] TABLE 16 EXPRESSION OF 17903 IN HUMAN CINICAL SAMPLES β 2 TissueType Mean Mean δδCt Expression PIT 400 Normal Breast 26.68 17.14 9.541.3387 PIT 372 Normal Breast 29.3 19 10.3 0.7932 PIT 56 Normal Breast28.57 21.13 7.45 5.7389 MDA 106 Breast Tumor 27.55 19.31 8.24 3.2962 MDA234 Breast Tumor 25.16 16.48 8.68 2.4466 NDR 57 Breast Tumor 27.16 17.859.31 1.5755 MDA 304 Breast Tumor 26.73 17.83 8.89 2.1006 NDR 58 BreastTumor 23.63 16.23 7.41 5.9003 NDR 132 Breast Tumor 26.78 20.02 6.769.2265 NDR 07 Breast Tumor 27.77 18.02 9.75 1.1613 NDR 12 Breast Tumor26.34 20.47 5.88 16.9802 PIT 208 Normal Ovary 27.2 17.52 9.68 1.2233 CHT620 Normal Ovary 27.32 18.02 9.3 1.5809 CHT 619 Normal Ovary 27.14 18.458.69 2.4297 CLN 03 Ovary Tumor 28.11 18.25 9.87 1.0724 CLN 05 OvaryTumor 26.31 17.47 8.84 2.1822 CLN 17 Ovary Tumor 25.59 18.63 6.96 8.0321CLN 07 Ovary Tumor 27.99 17.67 10.32 0.7823 CLN 08 Ovary Tumor 27.5917.21 10.38 0.7504 MDA 216 Ovary Tumor 28.65 19.07 9.58 1.3066 CLN012Ovary Tumor 26.43 19.65 6.79 9.068 MDA 25 Ovary Tumor 26.41 20.196.21 13.4617 MDA 183 Normal Lung 25.23 16.56 8.68 2.4466 CLN 930 NormalLung 28.5 19.3 9.21 1.6944 MDA 185 Normal Lung 26.71 18.07 8.64 2.5067CHT 816 Normal Lung 27.49 17.39 10.1 0.9112 MPI 215 Lung Tumor-SmC 24.817.68 7.11 7.239 MDA 259 Lung Tumor-PDNSCCL 25.04 18.2 6.84 8.6986 CHT832 Lung Tumor-PDNSCCL 25.27 17.48 7.78 4.5497 MDA 253 LungTumor-PDNSCCL 25.34 17.02 8.31 3.14 CHT 814 Lung Tumor-SCC 23.27 15.997.28 6.4566 CHT 793 Lung Tumor-ACA (?) 25.35 17.2 8.15 3.5205 MDA 262Lung Tumor-SCC 27.22 21.73 5.5 22.1738 CHT 211 Lung Tumor-AC 26.22 18.327.9 4.1866 Normal Human Bronchial 24.2 18.84 5.37 24.2647 Epithelium

[0232] TABLE 17 17903 EXPRESSION IN HUMAN CLINICAL SAMPLES β 2 TissueType Mean Mean δδCt Expression CHT 523 Normal Colon 25.38 18.17 7.216.78 NDR 104 Normal Colon 23.93 18.02 5.91 16.69 CHT 416 Normal Colon26.73 19.02 7.71 4.78 CHT 452 Normal Colon 26.41 17.18 9.22 1.67 NDR 210Colon Tumor 28.69 22.56 6.13 14.23 CHT 398 Colon Tumor 23.16 18.59 4.5841.96 CHT 382 Colon Tumor 29.18 20.66 8.53 2.71 CHT 944 Colon Tumor 24.917.86 7.04 7.63 CHT 528 Colon Tumor 22.86 17.67 5.2 27.30 CHT 368 ColonTumor 23.56 16.59 6.96 8.03 CHT 372 Colon Tumor 25.14 18.64 6.5 11.05CLN 609 Colon Tumor 24.39 18.32 6.07 14.94 CHT 01 Colon Cancer Liver23.82 17.49 6.33 12.43 Metastases CHT 3 Colon Cancer Liver 26.32 20 6.3212.52 Metastases CHT 340Colon Cancer Liver 25.29 19.77 5.53 21.72Metastases NDR 217Colon Cancer Liver 25.84 18.05 7.79 4.52 MetastasesPit 260 Normal Liver 25.15 16.5 8.65 2.49 CHT 320 Normal Liver 27.9821.43 6.55 10.67 A4 Arresting Human 22.56 17.45 5.11 29.06 MicrovascularEndothelial Cells HMVEC-Arr C48 Proliferating Human 24.07 19.65 4.4346.39 Microvascualr Endothelial Cells CHT 50 Placenta 30.29 24.45 5.8417.40 ONC 102 Hemangioma 25.95 18.4 7.55 5.32

[0233] TABLE 18 EXPRESSION OF MOUSE 17903 IN MOUSE TUMOR ANGIOGENICTISSUES β 2 Tissue Type Mean Mean δδCt Expression RIP Angio 25.49 17.537.96 4.0161 RIP Tumor 25.77 18.17 7.61 5.1365 Xeno Parent 1 26.07 17.228.86 2.1596 Xeno Parent 2 27.75 16.26 11.48 0.3489 Xeno VEGF 1 27.9317.58 10.35 0.7689 Xeno VEGF 2 26.34 15.99 10.35 0.7662 Spleen 22.2515.97 6.29 12.8241 Heart 20.98 12.94 8.04 3.7994 Kidney 21.9 14.26 7.645.0134 Colon 22.23 16.34 5.89 16.8046 VEGF 1 27.1 19.11 7.99 3.9334 VEGF2 26.56 17.22 9.34 1.543 P1 26.39 16.74 9.64 1.249 P2 27.45 17.26 10.20.8531

[0234] TABLE 19 EXPRESSION OF 17903 IN XENOGRAFT CELL LINES β 2 TissueType Mean Mean δδCt Expression MCF-7 Breast Tumor 23.25 18.67 4.58 41.96ZR75 Breast Tumor 24.02 21.18 2.85 138.70 T47D Breast Tumor 23.55 18.864.68 38.88 MDA 231 Breast Tumor 23.59 17.86 5.74 18.71 MDA 435 BreastTumor 22.97 17.66 5.3 25.30 SKBr3 Breast 25.13 20.4 4.74 37.55 DLD 1Colon Tumor (stageC) 22.07 20.7 1.37 388.23 SW480 Colon Tumor (stage B)25.62 21.55 4.08 59.33 SW620 Colon Tumor (stageC) 22.59 18.91 3.68 78.02HCT116 25.93 22.16 3.77 73.30 HT29 22.34 17.55 4.79 36.27 Colo 205 22.1116.36 5.75 18.58 NCIH125 22.97 20.02 2.94 129.86 NCIH67 25.41 20.88 4.5343.43 NCIH322 24.07 21.07 3 124.57 NCIH460 24.22 19.88 4.34 49.55 A54924.65 21.9 2.75 149.17 NHBE 24.96 21.27 3.69 77.75 SKOV-3 ovary 22.6817.74 4.93 32.69 OVCAR-3 ovary 25.09 21.07 4.02 61.64 293 Baby Kidney24.31 21.11 3.2 108.82 293T Baby Kidney 25.39 22.84 2.55 170.76

[0235] TABLE 20 EXPRESSION OF 17903 IN HUMAN TISSUES Tissue Mean 18SMean δCt Expression Adrenal Gland 28.20 14.33 13.87 0.07 Brain 28.0713.48 14.59 0.04 Heart 27.32 14.34 12.98 0.12 Kidney 26.85 14.36 12.490.17 Liver 28.62 14.24 14.39 0.05 Lung 27.26 15.30 11.96 0.25 Mammary27.10 14.42 12.68 0.15 Gland Pancreas 28.73 16.08 12.65 0.16 Placenta27.88 15.70 12.18 0.22 Prostate 28.35 14.94 13.41 0.09 Salivary Gland28.28 14.88 13.40 0.09 Muscle 27.77 14.89 12.89 0.13 Sm. Intestine 28.1215.02 13.10 0.11 Spleen 27.48 14.91 12.57 0.17 Stomach 27.85 14.68 13.170.11 TesteS 27.58 14.36 13.22 0.10 Thymus 27.45 14.09 13.36 0.10 Trachea27.96 15.05 12.91 0.13 Uterus 28.78 14.81 13.97 0.06 Spinal Cord 28.3214.90 13.42 0.09 Skin 28.63 15.20 13.43 0.09 DRG 29.80 15.56 14.24 0.05

[0236] TABLE 21 EXPRESSION OF 17903 IN HUMAN TISSUES β2M803 Tissue MeanMean δCt Expression Adrenal Gland 23.19 18.53 4.66 39.55 Brain 23.0720.14 2.93 131.21 Heart 22.88 19.15 3.73 75.36 Kidney 21.43 18.06 3.3796.72 Liver 24.14 19.08 5.07 29.87 Lung 22.68 16.82 5.87 17.16 Mammary21.68 17.30 4.39 47.86 Gland Placenta 22.03 18.37 3.67 78.84 Prostate22.48 17.68 4.80 35.90 Salivary Gland 22.96 18.73 4.23 53.29 Muscle22.20 20.53 1.68 313.17 Sm. Intestine 22.62 18.38 4.24 52.92 Spleen21.68 16.44 5.25 26.37 Stomach 22.56 18.04 4.52 43.74 Teste 22.13 19.602.53 173.14 Thymus 22.54 18.10 4.45 45.91 Trachea 22.97 19.05 3.92 66.29Uterus 24.06 18.30 5.76 18.45 Spinal Cord 23.07 18.84 4.24 53.11 Skin23.87 16.99 6.88 8.49 DRG 25.21 18.80 6.42 11.72

[0237] TABLE 22 EXPRESSION OF 17903 IN HUMAN CARDIOVASCULAR TISSUE β 2Tissue Type Mean Mean δδCt Expression Fetal Heart/normal/BWH 4 23.0817.07 6.01 15.5171 Heart/Normal/Atrium/MPI 1097 25.21 19.23 5.99 15.7883Heart/Normal/Atrium/PIT 277 22.35 15.49 6.86 8.6086Heart/Normal/Ventricle/PIT 272 22.84 16.3 6.54 10.7464Heart/Normal/Ventricle/TLO 1 26.04 19.27 6.76 9.1946Heart/Normal/Ventricle/PIT 278 23.18 16.45 6.74 9.3553Heart/Normal/Ventricle/PIT 204 21.68 16.52 5.17 27.8728Heart/Normal/Ventricle/PIT 205 22.45 16.54 5.91 16.6308Heart/Diseased/Ventricle/ELI 5 21.12 15.66 5.46 22.7183Heart/Diseased/Ventricle/PIT 16 23.21 16.16 7.04 7.5726Kidney/normal/NDR 171 27.46 19.68 7.78 4.5497 Kidney/normal/NDR 17924.32 16.8 7.53 5.4294 Kidney/normal/PIT 289 27.23 19.93 7.29 6.3678Kidney/normal/PIT 351 26.25 17.52 8.73 2.3551 Kidney/normal/PIT 35327.18 17.36 9.82 1.1063 Kidney/HT/NDR 233 26.54 18.21 8.32 3.1184Kidney/HT/NDR 224 24.46 16.36 8.1 3.6447 Kidney/HT/NDR 248 25.91 17.987.93 4.0863 Skeletal Muscle/Normal/MPI 27.16 18.07 9.09 1.8414 570Skeletal Muscle/Normal/PIT 284 26.36 19.13 7.24 6.6382 Liver/Normal/MPI155 29.1 15.64 13.46 0.0887 Liver/Normal/MPI 146 23.77 16.11 7.66 4.9615

[0238] TABLE 23 17903 EXPRESSION IN NORMAL HUMAN TISSUES Relative TissueType Expression Prostate 7.2 Prostate 16.5 Liver 3.7 Liver 18.4 Breast3.9 Breast 17.8 Skeletal Mucsle 11.4 Skeletal Mucsle 48.0 Brain 44.9Brain 10.7 Colon 8.6 Colon 8.2 Heart 35.0 Heart 11.1 Ovary 2.0 Ovary 1.0Kidney 6.3 Kidney 8.5 Lung 8.3 Lung 5.1 Vein 6.0 Vein 2.9 Aorta 13.3Testis 20.1 Testis 6.8 Thyroid 10.4 Thyroid 7.6 Placenta 5.6 Placenta6.0 Fetal Kidney 10.0 Fetal Kidney 70.0 Fetal Liver 9.1 Fetal Liver 38.6Fetal heart 29.3 Fetal heart 2.2 Osteoblasts (undif.) 14.0 Osteoblasts(dif.) 8.4 Small Intestine 5.6 Cervix 1.4 Spleen 4.0 Esoghagus 1.3Thymus 5.4 Tonsil 8.9 Lymphnote 10.2

[0239] TABLE 24 EXPRESSION OF 17903 IN HUMAN TISSUES β2 Tissue Type MeanMean δδCt Expression Artery normal 31.77 22 9.77 1.1493 Vein normal30.97 20.05 10.91 0.5179 Aortic Smooth Muscle Cells 24.32 19.65 4.6839.0103 (SMC) EARLY Coronary SMC 25.4 21.81 3.59 83.0429 Static HUVEC23.84 20.57 3.27 103.3063 Shear HUVEC 23.43 20.75 2.67 156.5831 Heartnormal 23.7 18.79 4.92 33.0318 Heart CHF 23.23 19.11 4.13 57.3128 Kidney24.99 20.45 4.54 42.837 Skeletal Muscle 25.81 21.19 4.62 40.6669 Adiposenormal 24.99 19.39 5.61 20.546 Pancreas 25.39 21.57 3.82 70.8052 primaryosteoblasts 24.99 19.22 5.78 18.2621 Osteoclasts (diff) 24.43 17.65 6.789.0995 Skin normal 26.47 21.09 5.38 24.097 Spinal cord normal 25.5219.83 5.68 19.4377 Brain Cortex normal 25.04 21.11 3.92 65.8351 BrainHypothalamus normal 26.26 21.02 5.24 26.4608 Nerve 30.57 24.23 6.3412.3444 DRG (Dorsal Root 27.47 21.82 5.66 19.8461 Ganglion) Glial Cells(Astrocytes) 26.15 22.12 4.03 61.2138 Glioblastoma 23.82 18.09 5.7318.8407 Breast normal 26.73 20.53 6.2 13.6024 Breast tumor 23.97 18.275.7 19.3034 Ovary normal 26.52 20.1 6.42 11.6785 Ovary Tumor 28.26 20.028.24 3.3076 Prostate Normal 25.3 19.53 5.76 18.3892 Prostate Tumor 23.7117.86 5.86 17.277 Epithelial Cells (Prostate) 25.22 21.23 3.99 62.9347Colon normal 24.2 18.15 6.05 15.0928 Colon Tumor 23.48 18.85 4.6340.2463 Lung normal 26.18 18.38 7.8 4.4716 Lung tumor 24.02 18.56 5.4622.7183 Lung chronic obstructive 24.15 18.48 5.67 19.5729 pulmonarydisease Colon IBD 24.32 18.11 6.21 13.5084 Liver normal 26.19 20.11 6.0814.7822 Liver fibrosis 26.9 21.74 5.16 28.0666 Dermal Cells-fibroblasts24.2 19.41 4.79 36.0214 Spleen normal 25.63 19.55 6.08 14.8335 Tonsilnormal 22.82 17.23 5.6 20.6173 Lymph node 24.29 18.74 5.55 21.3444 Smallintestine 26.07 19.71 6.36 12.2167 Skin-Decubitus 25.95 20.74 5.2127.1106 Synovium 27.08 20.53 6.55 10.6722 BM-MNC (Bone marrow 21.7 17.054.66 39.6922 mononuclear cells) Activated PBMC 23.09 16.14 6.95 8.088

[0240] TABLE 25 EXPRESSION OF 17903 IN HUMAN VESSEL TISSUES β2 TissueType Mean Mean δδCt Expression Aortic SMC (Early) 26.27 20.98 5.29 25.65Aortic SMC (Late) 26.56 21.91 4.64 40.11 HMVEC 24.34 19.6 4.74 37.55Human Umbilical Vein Endothelial 21.48 17.09 4.39 47.70 Cells (HUVEC)Confluent HUVEC IL 1 21.67 16.72 4.96 32.24 Adipose/MET 9 28.57 23.395.18 27.49 Artery/Normal/Carotid/CLN 595 28.98 19.27 9.71 1.19Artery/Normal/Carotid/CLN 598 29.8 20.16 9.63 1.26 Artery/normal/NDR 35227.94 20.06 7.88 4.25 Artery/Normal/Muscular/AMC 198 28.43 20.86 7.585.23 Artery/Normal/AMC 150 39.35 21.79 17.57 0.00 Artery/Normal/AMC 7338.26 24.69 13.57 0.00 Artery/Diseased/iliac/NDR 753 26.32 19.27 7.057.52 Artery/Diseased/Tibial/PIT 679 31.79 20.83 10.96 0.50Aorta/Diseased/PIT 732 30.81 22.68 8.13 3.57 Vein/Normal/Saphenous/AMC69 30.23 21.67 8.56 2.64 Vein/Normal/Saphenous/NDR 724 26.14 18.34 7.794.50 Vein/Normal/Saphenous/NDR 721 23.94 17.27 6.67 9.85Vein/Normal/SaphenousAMC 107 31.79 21.5 10.29 0.80 Vein/Normal/NDR 23931.07 21.17 9.89 1.05 Vein/Normal/Saphenous/NDR 237 28.27 19.79 8.482.80 Vein/Normal/NDR 235 31.23 22.81 8.43 2.91 Vein/Normal/MPI 1101 38.819.07 19.73 0.00 Vein/Diseased/Saphenous/AMC 70 25.61 19.02 6.59 10.34

[0241] TABLE 26 EXPRESSION OF RAT 17903 IN RAT TISSUES Tissue Mean HKMean δCt Expression Brain 26.12 14.99 11.14 0.22 Cortex 27.46 15.2012.26 0.10 Striatum 26.25 15.06 11.20 0.21 Thalamus 26.35 15.00 11.360.19 Cerebellum 26.04 15.18 10.87 0.26 Brain Stem 25.62 15.08 10.54 0.33Dorsal Nuclei 26.27 15.30 10.97 0.24 Spinal cord 25.31 15.05 10.26 0.40TRG 26.29 15.24 11.05 0.23 DRG 27.22 15.28 11.95 0.12 SCG 26.92 15.5011.42 0.18 Sciatic Nerve 25.03 15.25 9.78 0.55 Hairy Skin 26.19 15.5010.70 0.29 Gastro Muscle 25.12 15.47 9.65 0.60 Heart 24.74 15.29 9.450.70 Kidney 26.16 15.90 10.26 0.40 Liver 26.29 15.31 10.98 0.24 Lung25.03 15.19 9.84 0.53

[0242] TABLE 27 EXPRESSION OF RAT 17903 IN RAT TISSUES Tissue Mean 18SMean δCT Expression Na{dot over (i)}ve DRG 25.12 12.63 12.50 0.17 I DRGCCI 3 26.25 13.87 12.39 0.18 I DRG CCI 7 26.13 13.50 12.63 0.15 I DRGCCI 14 26.30 13.47 12.83 0.13 I DRG CCI 10 26.10 13.50 12.60 0.16 I DRGCCI 28 26.05 12.84 13.21 0.10 Na{dot over (i)}ve DRG 25.12 12.63 12.500.17 I DRG CFA 1 25.99 12.38 13.61 0.08 I DRG CFA 3 26.13 12.92 13.210.10 I DRG CFA 7 26.11 12.78 13.33 0.09 I DRG CFA 14 27.35 13.44 13.910.06 I DRG CFA 28 26.28 13.04 13.24 0.10 Na{dot over (i)}ve DRG 25.1212.63 12.50 0.17 I DRG AXT 1 25.75 12.19 13.56 0.08 I DRG AXT 3 26.0612.62 13.45 0.09 I DRG AXT 7 26.48 13.04 13.44 0.09 I DRG AXT 14 26.4212.43 13.99 0.06 I DRG AXT 28 26.15 13.99 12.16 0.21

[0243] TABLE 28 EXPRESSION OF RAT 17903 IN RAT TISSUES Tissue r17903 18SδCt Expression Na{dot over (i)}ve SC 26.73 13.97 12.76 0.11 I SC CCI 325.41 13.72 11.69 0.24 I SC CCI 7 25.19 14.04 11.15 0.34 I SC CCI 1425.03 13.68 11.35 0.30 Na{dot over (i)}ve SC 26.73 13.97 12.76 0.11 I SCCFA 3 27.01 13.39 13.62 0.06 I SC CFA 7 24.78 13.64 11.15 0.35 I SC CFA14 27.61 13.51 14.10 0.04 I SC CFA 28 25.61 13.62 11.99 0.19 Na{dot over(i)}ve SC 25.10 12.67 12.43 0.14 I SC AXT 1 24.79 12.58 12.21 0.16 I SCAXT 3 25.11 12.93 12.19 0.17 I SC AXT 7 25.49 13.14 12.35 0.15 I SC AXT14 25.20 12.40 12.80 0.11 I SC AXT 28 25.62 12.39 13.24 0.08

[0244] TABLE 29 EXPRESSION OF 17903 HK Relative Tissue Average AverageδCT Expression MK Cortex 23.08 21.375 1.705 0.17504337 MK DRG 23.4117.99 5.42 0.01332967 MK Spinal Chord 22.415 19.135 3.28 0.0587521 MKSciatic Nerve 21.305 17.85 3.455 0.0520407 MK Kidney 21.49 18.155 3.3350.05655445 MK hairy skin 21.02 18.95 2.07 0.13591573 MK heart LV 21.3417.965 3.375 0.05500796 MK gastro 21.225 19.165 2.06 0.13686109 muscleMK liver 22.175 18.48 3.695 0.04406522 MK gastro 21.34 19.21 2.130.13037908 muscle Human brain 21.475 19.33 2.145 0.12903052 Human spinal22.29 18.615 3.675 0.04468035 chord Human Kidney 21.32 18.165 3.1550.06406962 Human Liver 23.055 18.305 4.75 0.02120847 Human Lung 21.3116.12 5.19 0.0156335

[0245] Human 3700

[0246] The invention is based, at least in part, on the discovery of anovel protein kinase, herein referred to as “3700”. The human 3700 cDNAsequence (SEQ ID NO:43), which is approximately 3353 nucleotide residueslong including non-translated regions, contains a predictedmethionine-initiated coding sequence of about 1884 nucleotide residues,excluding termination codon (i.e., nucleotide residues 157-2040 of SEQID NO:43; also shown in SEQ ID NO:45). The coding sequence encodes a 628amino acid protein having the amino acid sequence SEQ ID NO:44.

[0247] Human 3700 contains the following regions or other structuralfeatures: a predicted pkinase domain (PF00069) at about amino acidresidues 53-303 of SEQ ID NO:44, a protein kinases ATP-binding regionsignature sequence at residues 59 to 67 of SEQ ID NO:44, and aserine/threonine protein kinase active site signature sequence atresidues 171 to 183 of SEQ ID NO:44. A transmembrane domain is predictedat about amino acid residues 234 to 250 of SEQ ID NO:44.

[0248] The human 3700 protein has predicted N-glycosylation sites (Pfamaccession number PS00001) at about amino acid residues 121-124 and576-579 of SEQ ID NO:44; predicted cAMP-/cGMP-dependent protein kinasephosphorylation sites (Pfam accession number PS00004) at about aminoacid residues 290-293, 337-340, and 413-416 of SEQ ID NO:44; predictedprotein kinase C phosphorylation sites (Pfam accession number PS00005)at about amino acid residues 30-32, 74-76, 82-84, 122-124, 142-144,148-150, 289-291, 327-329, 339-341, 373-375, 377-379, and 616-618 of SEQID NO:44; predicted casein kinase II phosphorylation sites (Pfamaccession number PS00006) located at about amino acid residues 15-18,133-136, 148-151, 227-230, 293-296, 331-334, 377-380, 391-394, 461-464,511-514, 523-526, 578-581, and 606-609 of SEQ ID NO:44; a predictedtyrosine kinase phosphorylation site at residues 453-460 of SEQ IDNO:44; predicted N-myristoylation sites (Pfam accession number PS00008)at about amino acid residues 320-325, 347-352, and 360-365 of SEQ IDNO:44; and a predicted cell attachment sequence at about amino acidresidues 134-136 of SEQ ID NO:44.

[0249] Polypeptides of the invention include fragments which include:all or part of a hydrophobic sequence, e.g., the sequence of aboutresidues 234-250 of SEQ ID NO:44; all or part of a hydrophilic sequence,e.g., the sequence of residues 40-55 or 445-470 of SEQ ID NO:44; asequence which includes a cysteine residue; or a glycosylation site.

[0250] For general information regarding PFAM identifiers, PS prefix andPF prefix domain identification numbers, refer to Sonnhammer et al.(1997, Protein 28:405-420).

[0251] The 3700 protein contains a significant number of structuralcharacteristics in common with members of the Protein Kinase family.Protein phosphorylation is influenced primarily by enzymes of two types,namely protein kinases (PKs) and protein phosphatases (PPs). PKscatalyze addition of a phosphate moiety to a protein amino acid residue(generally a serine, threonine, or tyrosine residue), and PPs catalyzeremoval of such moieties. The catalytic activities of PKs and PPs are,in turn, influenced by the state of the cell and the environment inwhich it finds itself. Phosphorylation of amino acid residues by a PKgenerally manifests itself in the form of faster cell growth,metabolism, or division, as greater motility, or in the form of highergene transcription, although certain physiological processes areinhibited by protein phosphorylation. De-phosphorylation of amino acidresidues by a PP, by contrast, generally manifests itself as slower (orhalted) cell growth, division, or metabolism, as lower motility, or inthe form of lower gene transcription. PK/PP-modulated proteinphosphorylation is also involved in carcinogenesis.

[0252] Without being bound by any particular theory of operation, 3700protein is believed to be a serine/threonine kinase.

[0253] A 3700 polypeptide can include a pkinase domain. As used herein,the term “pkinase domain” refers to a protein domain having an aminoacid sequence of about 200-300 amino acid residues in length,preferably, at least about 225-300 amino acids, more, preferably about278 amino acid residues or about 251 amino acid residues and has a bitscore for the alignment of the sequence to the pkinase domain (HMM) ofat least 100 or greater, preferably 200 or greater, and more preferably300 or greater. The pkinase domain has been assigned the PFAM accessionPF00069.

[0254] In a preferred embodiment, 3700 polypeptide or protein has apkinase domain or a region which includes at least about 200-300, morepreferably about 225-300, 278, or 251 amino acid residues and has atleast about 60%, 70%, 80%, 90%, 95%, 99%, or 100% homology with apkinase domain, e.g., the pkinase domain of human 3700 (e.g., residues53-303 of SEQ ID NO:44).

[0255] To identify the presence of a pkinase domain profile in a 3700receptor, the amino acid sequence of the protein is searched against adatabase of HMMs (e.g., the Pfam database, release 2.1) using thedefault parameters. For example, the hmmsf program, which is availableas part of the HMMER package of search programs, is a family specificdefault program for PF00069 and score of 100 is the default thresholdscore for determining a hit. For example, using ORFAnalyzer software, apkinase domain profile was identified in the amino acid sequence of SEQID NO:44 (e.g., amino acids 53-303 of SEQ ID NO:44). Accordingly, a 3700protein having at least about 60-70%, more preferably about 70-80%, orabout 80-90% homology with the pkinase domain profile of human 3700 iswithin the scope of the invention.

[0256] In one embodiment, a 3700 protein includes at least onetransmembrane domain. As used herein, the term “transmembrane domain”includes an amino acid sequence of about 5 amino acid residues in lengththat spans the plasma membrane. More preferably, a transmembrane domainincludes about at least 10, 15, 20 or 22 amino acid residues and spans amembrane. Transmembrane domains are rich in hydrophobic residues, andtypically have an alpha-helical structure. In a preferred embodiment, atleast 50%, 60%, 70%, 80%, 90%, or 95% or more of the amino acids of atransmembrane domain are hydrophobic, e.g., leucines, isoleucines,tyrosines, or tryptophans. Transmembrane domains are described in, forexample, Zagotta W. N. et al. (1996, Annu. Rev. Neurosci. 19: 235-263),the contents of which are incorporated herein by reference. Amino acidresidues 234 to about 250 of SEQ ID NO:44 comprise a transmembranedomain in a 3700 protein. In one embodiment, the amino-terminal domainof 3700 protein (i.e., about residues 1-233 of SEQ ID NO:44) is on thecytoplasmic side of a cellular membrane (e.g., the nuclear membrane orthe cytoplasmic membrane) and the carboxyl-terminal domain (i.e., aboutresidues 251-628 of SEQ ID NO:44) is on the non-cytoplasmic side of thesame membrane. In another embodiment, the amino-terminal domain isoriented on the non-cytoplasmic side of the membrane and thecarboxyl-terminal domain is oriented on the cytoplasmic side.

[0257] While not being bound by any particular theory of operation, 3700protein is believed to be, in at least one embodiment, a nuclearmembrane protein having its carboxyl-terminal domain oriented within thenuclear envelope. In this embodiment, 3700 protein is capable oftransmitting signaling information from the cytoplasm to the nucleus,whereby, for example, gene transcription can be regulated.

[0258] In one embodiment of the invention, a 3700 polypeptide includesat least one pkinase domain. In another embodiment, the 3700 polypeptideincludes at least one pkinase domain and at least one transmembranedomain. The 3700 molecules of the present invention can further includeone or more of the N-glycosylation, cAMP-/cGMP-dependent protein kinasephosphorylation, protein kinase C phosphorylation, casein kinase IIphosphorylation, tyrosine kinase phosphorylation, N-myristoylation, andcell attachment sites described herein, and preferably comprises most orall of them.

[0259] Because the 3700 polypeptides of the invention can modulate3700-mediated activities, they can be used to develop novel diagnosticand therapeutic agents for 3700-mediated or related disorders, asdescribed below.

[0260] As used herein, a “3700 activity,” “biological activity of 3700,”or “functional activity of 3700,” refers to an activity exerted by a3700 protein, polypeptide or nucleic acid molecule on, for example, a3700-responsive cell or on a 3700 substrate (e.g., a protein substrate)as determined in vivo or in vitro. In one embodiment, a 3700 activity isa direct activity, such as association with a 3700 target molecule. A“target molecule” or “binding partner” of a 3700 protein is a molecule(e.g., a protein or nucleic acid) with which the 3700 protein binds orinteracts in nature. In an exemplary embodiment, such a target moleculeis a 3700 receptor. A 3700 activity can also be an indirect activity,such as a cellular signaling activity mediated by interaction of the3700 protein with a 3700 receptor.

[0261] The 3700 molecules of the present invention are predicted to havesimilar biological activities as PK family members. For example, the3700 proteins of the present invention can have one or more of thefollowing activities: (1) catalyzing formation of a covalent bond withinor between an amino acid residue (e.g., a serine or threonine residue)and a phosphate moiety; (2) modulating cell signaling; (3) modulatingcell growth; (4) modulating cell differentiation; (5) modulatingtumorigenesis; (6) modulating entry of a cell into the cell cycle; (7)modulating progression of a cell through the cell cycle; (8) modulatingmitogenesis; (9) modulating cell motility; (10) modulating acell-to-cell interaction; (11) modulating cell metabolism; (12)modulating gene transcription; (13) modulating an immune response; (14)modulating angiogenesis; (15) modulating tissue (e.g., kidney or liver)repair or regeneration; (16) modulating establishment ofatherosclerosis; (17) modulating progression of atherosclerosis; and(18) modulating signaling across the blood-brain barrier.

[0262] Thus, 3700 molecules described herein can act as novel diagnostictargets and therapeutic agents for prognosticating, diagnosing,preventing, inhibiting, alleviating, or curing PK-related disorders.

[0263] Other activities, as described below, include the ability tomodulate function, survival, morphology, proliferation and/ordifferentiation of cells of tissues in which 3700 molecules areexpressed. Thus, the 3700 molecules can act as novel diagnostic targetsand therapeutic agents for controlling disorders involving aberrantactivities of these cells.

[0264] The 3700 molecules can also act as novel diagnostic targets andtherapeutic agents for controlling cellular proliferative and/ordifferentiative disorders (e.g., hematopoietic neoplastic disorders,carcinoma, sarcoma, metastatic disorders or hematopoietic neoplasticdisorders, e.g., leukemias. A metastatic tumor can arise from amultitude of primary tumor types, including but not limited to those ofprostate, colon, lung, breast and liver origin.

[0265] Expression data included herein indicate that 3700 is highlyexpressed in tissues having endothelial or epithelial cell layers, suchas in blood vessels, kidney, and pancreas. These data indicate that 3700protein can be involved in a variety of disorders that afflictendothelial and epithelial tissues. Examples of such disorders includecardiovascular disorders such as atherosclerosis, arteriosclerosis,abnormal blood coagulation, and coronary artery disease.

[0266] 3700 is expressed in aortic and coronary smooth muscle cells,indicating that 3700 can have a role in disorders that affect thesetissues. Examples of these disorders include coronary artery disease andcardiac insufficiency. 3700 can also be involved in the response ofaortic and coronary tissues to ischemic damage, such as that associatedwith cardiac infarction or thrombotic injury to coronary arteries.

[0267] Expression of 3700 is enhanced in the presence of inflammatorycytokines, indicating a role for 3700 in normal and aberrantinflammatory responses. 3700 can have a role in a variety of immunedisorders in tissues in which it is expressed. By way of example, 3700can have a role in prostatitis, pancreatitis, meningitis, severeallergic reactions, and in autoimmune disorders. Modulating the activityor expression of 3700 can affect the severity of the immune disorder.

[0268] Expression of 3700 increases with age in transgenic mice in whichthe apoE gene has been silenced. The apoE mouse is an accepted model ofatherosclerosis, and genes that are upregulated in that model often havea role in establishment or progression of atherosclerosis. Inflammatorycytokines are also known to enhance expression of genes (e.g., thoseencoding VCAM and E-selectin) that are associated with establishment andprogression of atherosclerosis. These observations indicate that 3700 isinvolved in atherosclerosis in humans, and the establishment andprogression of atherosclerosis in humans can be modulated by modulatingone or both of expression and activity of 3700. Expression of 3700appears to be enhanced earlier than other known inflammatory effectormolecules, indicating that inhibition of activity or expression of 3700may have a more beneficial effect than therapeutic methods involving theother known inflammatory effector molecules.

[0269] The significant expression of 3700 in kidney tissues indicates arole for 3700 in the normal and aberrant functions of kidney tissues.Various kidney disorders can be associated with aberrant activity orexpression of 3700. Examples of these kidney-related disorders in which3700 can have a role include pancreatitis, endocrine and exocrine tumorsof the pancreas, diabetes, pancreatic abscesses, pancreatic fibrocysticdisease, and pancreatic cholera.

[0270] Expression of 3700 activity in astrocytes indicates that 3700 canhave a significant role in modulating signaling between the blood andbrain/central nervous system compartments. Ability of 3700 to contactmolecules that are present in the bloodstream or in the cerebrospinalfluid and to modulate the phosphorylation state of a protein in responseto such contact permits passage of a signal from one compartment to theother without the necessity for passage of a large molecule between thecompartments. Regulation of 3700 expression by inflammatory cytokinesindicates that 3700 protein can interact with relatively small peptideeffectors which normally or aberrantly occur in blood or cerebrospinalfluid. Thus, modulation of 3700 activity or expression permits one toaffect passage of signals between the blood and brain compartments.

[0271] Expression of 3700 in arterial tissue indicates that 3700 canhave a role in formation of new blood vessels (angiogenesis), such asthat associated with establishment or reestablishment of blood supply toa tumor or a wounded tissue. Higher levels of 3700 expression weredetected in lung, colon, ovarian, and breast tumors than in thecorresponding normal tissues. These observations indicate that 3700 canenhance establishment and increase of blood supply to tumors and otherrapidly-growing tissues (e.g., traumatized arterial endothelium) andthat modulation of 3700 activity, expression, or both, can limitestablishment and increase of blood supply to such tissues.

[0272] 3700 was more highly expressed in diseased liver tissue (e.g.,liver tissue obtained from patients with fibrosed or HBV-infectedlivers) than in normal liver tissues. These observations indicate that3700 can modulate liver tissue repair and that 3700 can also serve as anindicator of liver tissue damage. Increased expression of 3700 indamaged or diseased liver tissue indicates that such tissues are betterable than non-damaged liver to react to the presence of inflammatorycytokines (e.g., inducing apoptosis of seriously damaged liver cells orincreased attraction of cells which induce regeneration or repair ofliver tissue) and that such tissues direct increased blood supply,relative to non-damaged liver tissues. These functions can be moregenerally applicable, meaning that increased expression of 3700 in cellsof a non-liver tissue can enhance blood supply to the tissue and canenhance repair or regeneration of the tissue.

[0273] Modulation of 3700 activity, expression, or both can be used toinhibit, prevent, alleviate, or cure the disorders discussed herein.Furthermore, assessment of the level of 3700 activity, expression, orboth, can be used to diagnose or prognosticate these disorders.

[0274] Without being bound by any particular theory of operation, it isbelieved that the ability of 3700 protein to phosphorylate proteins,combined with its transmembrane nature, indicates an ability of 3700protein to transmit signals from the external environment of the cell tothe interior of the cell. Protein phosphorylation (e.g., that associatedwith G-protein signaling) is known to be a method by which transcriptionof genes can be modulated in response to extracellular stimuli. 3700protein can bind molecules (e.g., inflammatory cytokines such as tumorgrowth factor beta or endothelial growth factor) in the extracellularmilieu, undergo a conformational or other change, and exhibit anintracellular protein kinase activity. The intracellularlyphosphorylated protein can phosphorylate another protein or affect theconformation or protein-binding-state of a nucleic acid. Thus, directlyor indirectly, 3700 can affect the likelihood or rate at which a gene istranscribed, thereby correlating occurrence of an intracellular geneproduct with the presence of an extracellular signaling molecule. In oneembodiment, the membrane in which 3700 protein is embedded is thenuclear membrane, and 3700 protein catalyzes a change in thephosphorylation state of a nuclear membrane protein or an intranuclearprotein in response to occurrence of a signaling molecule in thecytoplasm of the cell.

[0275] Identification and Characterization of Human 3700 cDNA

[0276] The human 3700 nucleotide sequence (SEQ ID NO:43), which isapproximately 3353 nucleotides in length including non-translatedregions, contains a predicted methionine-initiated coding sequence atabout nucleotide residues 157-2040. The coding sequence encodes a 628amino acid protein (SEQ ID NO:44).

[0277] Expression of the 3700 Gene

[0278] Tables 30-41 list the results of real time quantitative PCR(TAQMAN®) analyses of 3700 gene expression in selected cells andtissues. In the Tables, “M” means monkey. TABLE 30 Relative Tissue TypeExpression of 3700 Artery normal 0 Vein normal 0 Aortic smooth musclecells EARLY 1.76 Coronary smooth muscle cells 5.66 Static humanumbilical vein endothelial cells 0 Shear human umbilical veinendothelial cells 1.24 Heart normal 0 Heart - congestive heart failure 0Kidney 44.3 Skeletal Muscle 0 Adipose normal 0 Pancreas 10.7 primaryosteoblasts 0.60 Osteoclasts (diff) 0 Skin normal 0.25 Spinal cordnormal 0 Brain Cortex normal 0.32 Brain Hypothalamus normal 0.42 Nerve 0Dorsal Root Ganglion 0 Glial Cells (Astrocytes) 64.03 Glioblastoma 0.11Breast normal 0 Breast tumor 0.53 Ovary normal 0.12 Ovary Tumor 5.26Prostate Normal 0 Prostate Tumor 0 Prostate Epithelial Cells 41.1 Colonnormal 0.22 Colon Tumor 4.96 Lung normal 0 Lung tumor 0.70 Lung -chronic obstrucive pulmonary disorder 0.28 Colon - inflammatory boweldisorder 0 Liver normal 0.098 Liver fibrosis 0.104 Dermal Cells-fibroblasts 0.56 Spleen normal 1.01 Tonsil normal 1.30 Lymph node 0.66Small Intestine 0.15 Skin-Decubitus 0.56 Synovium 0 Bone marrowmononuclear cells 0.48 Activated peripheral blood mononuclear cells 0

[0279] TABLE 31 Relative Tissue Type Expression of 3700 Artery normal0.804 Vein normal 0.331 Aortic smooth muscle cells EARLY 8.73 Coronarysmooth muscle cells 20.9 Static human umbilical vein endothelial cells2.70 Shear human umbilical vein endothelial cells 3.41 Heart normal0.366 Heart - congestive heart failure 0.280 Kidney 31.1 Skeletal Muscle1.73 Adipose normal 0.279 Pancreas 14.9 primary osteoblasts 2.13Osteoclasts (diff) 0.459 Skin normal 6.66 Spinal cord normal 1.52 BrainCortex normal 4.32 Brain Hypothalamus normal 5.49 Nerve 3.45 Dorsal RootGanglion 2.56 Resting peripheral blood mononuclear cells 1.56Glioblastoma 1.32 Breast normal 0.745 Breast tumor 3.31 Ovary normal4.52 Ovary Tumor 51.7 Prostate Normal 2.46 Prostate Tumor 0.950Epithelial Cells (Prostate) 52.2 Colon normal 2.77 Colon Tumor 17.3 Lungnormal 0.614 Lung tumor 7.31 Lung - chronic obstrucive pulmonarydisorder 2.51 Colon - inflammatory bowel disorder 0.308 Liver normal2.56 Liver fibrosis 16.2 Dermal Cells- fibroblasts 2.09 Spleen normal7.09 Tonsil normal 2.87 Lymph node 5.05 Small intestine 2.39Skin-Decubitus 3.30 Synovium 0.475 Bone marrow mononuclear cells 1.31Activated peripheral blood mononuclear cells 0.063

[0280] TABLE 32 Relative Tissue Type Expression of 3700 PIT 400 NormalBreast 0.00 PIT 372 Normal Breast 0.00 CHT 558 Normal Breast 0.00 CLN168 Breast Tumor: IDC 0.00 MDA 304 Breast Tumor: MD-IDC 0.33 NDR 58Breast Tumor: IDC 1.19 NDR 05 Breast Tumor: IDC 0.04 CHT 562 BreastTumor: IDC 0.00 NDR 138 Breast Tumor ILC (LG) 32.7 CHT 1841 Lymph node(Breast metastasis) 0.00 PIT 58 Lung (Breast metastasis) 0.00 PIT 208Normal Ovary 60.2 CHT 620 Normal Ovary 145 CLN 03 Ovary Tumor 62.9 CLN17 Ovary Tumor 199 MDA 25 Ovary Tumor 141 MDA 216 Ovary Tumor 0.00 CLN012 Ovary Tumor 0.77 MDA 185 Normal Lung 11.3 CLN 930 Normal Lung 21.1MDA 183 Normal Lung 33.6 MPI 215 Lung Tumor - SmC 10.2 MDA 259 LungTumor - PDNSCCL 0.01 CHT 832 Lung Tumor - PDNSCCL 36.5 MDA 262 LungTumor - SCC 9.96 CHT 793 Lung Tumor - ACA 4.47 CHT 331 Lung Tumor - ACA50.1 CHT 405 Normal Colon 0.90 CHT 523 Normal Colon 1.78 CHT 371 NormalColon 0.01 CHT 382 Colon Tumor: MD 92.5 CHT 528 Colon Tumor: MD 90.9 CLN609 Colon Tumor 9.49 CHT 372 Colon Tumor: MD-PD 64.0 CHT 340 Colon-Livermetastasis 33.6 NDR 100 Colon-Liver metastasis 13.7 PIT 260 Normal Liver(female) 0.00 CHT 1653 Cervix Squamous CC 0.00 CHT 569 Cervix SquamousCC 0.51 A24 HMVEC-Arr 3.45 C48 HMVEC-Prol 0.00

[0281] TABLE 33 Relative 3700 Expression in Breast Tissues RelativeBreast Tissue Type Expression of 3700 MCF10MS 85.7 MCF10A 0.11MCF10AT.cl1 20.6 MCF10AT.cl3 30.5 MCF10AT1 14.9 MCF10AT3B 1.20MCF10CA1a.cl1 0.27 MCF10AT3B Agar 56.7 MCF10CA1a.cl1 Agar 2.91MCF10A.m25 Plastic 0.38 MCF10CA Agar 0.26 MCF10CA Plastic 1.43 MCF3BPlastic 3.73 MCF10A EGF 0 hr 0.25 MCF10A EGF 0.5 hr 0.19 MCF10A EGF 1 hr0.08 MCF10A EGF 2 hr 0.02 MCF10A EGF 4 hr 0.19 MCF10A EGF 8 hr 0.21MCF10A IGF1A 0 hr 1.14 MCF10A IGF1A 0.5 hr 0.45 MCF10A IGF1A 1 hr 0.55MCF10A IGF1A 3 hr 1.10 MCF10A IGF1A 24 hr 1.53 MCF10AT3B.cl5 Plastic2.51 MCF10AT3B.cl6 Plastic 1.86 MCFI0AT3B.cl3 Plastic 2.51 MCF10AT3B.cl1Plastic 3.64 MCF10AT3B.cl4 Plastic 0.37 MCF10AT3B.cl2 Plastic 2.08MCF10AT3B.cl5 Agar 14.8 MCF10AT3B.cl6 Agar 26.3 MCF-7 106 ZR-75 78.0T47D 28.2 MPA-231 14.9 MDA-435 3.68 SkBr3 24.5 Hs578Bst 6.68 Hs578T 0.81MCF3B Agar 3.83

[0282] TABLE 34 Blood Vessel Tissue Type Relative Expression of 3700Aortic SMC 0.32 HMVEC 0.00 Human Adipose 0.00 HumanArtery/Normal/Carotid 0.00 Human Artery/Normal/Carotid 0.00 HumanArtery/Normal/Muscular 0.00 Artery/Normal 0.00 Artery/Normal 0.00 HumanArtery/Diseased/iliac 0.00 Human Artery/Diseased/Tibial 0.00 HumanAorta/Diseased 0.00 Human Vein/Normal/Saphenous 0.00 HumanVein/Normal/Saphenous 0.00 Human Vein/Normal/Saphenous 0.00 HumanVein/Normal/Saphenous 0.00 Human Vein/Diseased/Saphenous 0.00 HumanVein/Normal/ 0.00 Human Vein/Normal/Saphenous 0.00 Human Vein/Normal/0.00 Vein/Normal 0.00 M/Artery/Normal/Coronary 0.00M/Artery/Normal/Coronary 0.00 M/Artery/Normal/Coronary 0.00M/Artery/Normal/Coronary 0.00 M/Vein/Normal 0.00

[0283] TABLE 35 Relative Tissue Type Expression of 3700 HumanArtery/normal/NDR 352 0.373 Human IM Artery/Normal/AMC 73 0 HumanMuscular Artery/Normal/AMC 236 0 Human Muscular Artery/Normal/AMC 247 0Human Aorta/Diseased/PIT 710 0.216 Human Aorta/Diseased/PIT 711 0.914Human Aorta/Diseased/PIT 712 0.169 Human Artery/Diseased/iliac/NDR 7530.038 Human Artery/Diseased/Tibial/PIT 679 0.395 M/Aorta/Normal/MPI 5430 M/Vein/Normal/MPI 536 0 M/CAR 1174/Artery/Diseased 128 M/CAR1175/Artery/Diseased 9254 M/PRI 2/Pancreas 7.60 M/MPI 88/Kidney/Normal15830 M/MPI 282/Kidney/Normal 13090

[0284] TABLE 36 Relative Tissue Type Expression of 3700 Aortic smoothmuscle cell 16.9 Coronary smooth muscle cell 50.4 Huvec Static 5.28Huvec LSS 24.1 Human Adipose/MET 9 0.511 Human Artery/Normal/Carotid/CLN595 1.28 Human Artery/Normal/Carotid/CLN 598 1.05 HumanArtery/normal/NDR 352 2.53 Human IM Artery/Normal/AMC 73 0 HumanMuscular Artery/Normal/AMC 236 0 Human Muscular Artery/Normal/AMC 247 0Human Muscular Artery/Normal/AMC 254/ 0 Human Muscular Artery/Normal/AMC259 0 Human Muscular Artery/Normal/AMC 261 0.874 Human MuscularArtery/Normal/AMC 275 0.871 Human Aorta/Diseased/PIT 732 4.27 HumanAorta/Diseased/PIT 710 0.607 Human Aorta/Diseased/PIT 711 0.442 HumanAorta/Diseased/PIT 712 0.665 Human Artery/Diseased/iliac/NDR 753 0.143Human Artery/Diseased/Tibial/PIT 679 1.15 Human Vein/Normal/SaphenousAMC107 0.152 Human Vein/Normal/NDR 239 0.717 HumanVein/Normal/Saphenous/NDR 237 0.638 Human Vein/Normal/PIT 1010 0.250Human Vein/Normal/AMC 191 1.25 Human Vein/Normal/AMC 130 0.614 HumanVein/Normal/AMC 188 0 HUVEC Vehicle 2.73 HUVEC Mev 1.60 HAEC Vehicle0.571 HAEC Mev 0.428

[0285] TABLE 37 Tissue Type Relative Expression of 3700 M/CAR1174/Artery/Diseased 0 M/CAR 1175/Artery/Diseased 0 M/PRI 2/Pancreas1.31 M/MPI 282/Kidney/Normal 0 M/MPI 282/Kidney/Normal 0 Human PIT289/Kidney/Normal 20.7 Human NDR 233/Kidney/HT 8.52 Human NDR224/Kidney/HT 19.2 Human NDR 248/Kidney/HT 26.1 Human MPI146/Liver/Normal 0.106

[0286] TABLE 38 Tissue Type Relative Expression of 3700 ONC 101Hemangioma 0 ONC 102 Hemangioma 0.07 ONC 103 Hemangioma 0 NDR 203 NormalKidney 120 PIT 213 Renal Cell Carcinoma 1.05 CHT 732 Wilms Tumor 2.93CHT 765 Wilms Tumor 9.04 NDR 295 Skin 3.71 CHT 1424 UterineAdenocarcinoma 0.25 CHT 1238 Neuroblastoma 0.04 BWH 78 Fetal Adrenal 0BWH 74 Fetal Kidney 26.5 BWH 4 Fetal Heart 0 MPI 849 Normal Heart 0 CLN746 Spinal cord 0.58 CHT 1273 Glioblastoma 0.27 CHT 216 Glioblastoma0.64 CHT 501 Glioblastoma 4.69

[0287] TABLE 39 Tissue Type Relative Expression of 3700 Conf HMVEC 0.000Aortic SMC 0.211 Human Fetal Heart 0.000 Human Heart Normal Atrium 0.000Human Heart Normal Atrium 0.000 Human Heart Normal Ventricle 0.000 HumanHeart Normal Ventricle 0.000 Human Heart Normal Ventricle 0.000 HumanHeart Normal Ventricle 0.000 Human Heart Normal Ventricle 0.000 HumanHeart Diseased Ventricle 0.000 Human Heart Diseased Ventricle 0.000Human Heart Diseased Ventricle 0.002 Human Kidney normal 9.62 HumanKidney normal 32.0 Human Kidney normal 7.52 Human Kidney normal 4.55Human Kidney normal 2.03 Human Kidney HT 5.64 Human Kidney HT 9.89 HumanKidney HT 12.9 Human Kidney HT 8.32 Human Skeletal Muscle 0.000 HumanSkeletal Muscle 0.001 Human Liver 0.000 Human Liver 0.000 Fetal AdrenalNormal 0.000 Wilms Tumor 0.793 Wilms Tumor 0.262 Spinal Cord Normal0.006 Cartilage Diseased 0.016 M Heart Normal Atrium 0.001 M HeartNormal Atrium 0.002 M Heart Normal Ventricle 0.002 M Heart NormalVentricle 0.009

[0288] TABLE 40 Liver Tissue Type Relative Expression of 3700 Liver NDR200 20 Liver CHT 339 25 Liver Pit 260 12 MAI 01 14 MAI 10 18 Hep C+ 51826 Hep C+ 519 54 HepG2 174 HepG2.2.15 1120 HBV-X Trans con #17 202 HBV-XTrans #18 426 NT2/KOS 0 hr. 3340 NT2/KOS 2.5 hr. 5940 NT2/KOS 5 hr. 4760NT2/KOS 7 hr. 7160

[0289] TABLE 41 Tissue Type Relative Expression of 3700 M/CAR1174/Artery/Diseased 1.62 M/CAR 1175/Artery/Diseased 0.11 M/PRI2/Pancreas 44.5 M/MPI 88/Kidney/Normal 87.8 M/MPI 282/Kidney/Normal 184Human/PIT 289/Kidney/Normal 1110 Human/NDR 233/Kidney/HT 79.7 Human/NDR224/Kidney/HT 151 Human/NDR 248/Kidney/HT 209 Human/MPI 146/Liver/Normal4.20

[0290] Human 21529

[0291] The present invention is based, at least in part, on theidentification of novel molecules, referred to herein as “21529”, alsoknown as adenylate cyclase nucleic acid and polypeptide molecules, whichplay a key role in regulation of the cyclic AMP (cAMP) signaltransduction pathway by virtue of their conversion of intracellular ATPinto cAMP. In one embodiment, the adenylate cyclase molecules modulatethe activity of one or more proteins involved in cellular metabolismassociated with cell maintenance, growth, or differentiation, e.g.,cardiac, epithelial, or neuronal cell maintenance, growth, ordifferentiation. In another embodiment, the adenylate cyclase moleculesof the present invention are capable of modulating the phosphorylationstate of one or more proteins involved in cellular metabolism associatedwith cell maintenance, growth, or differentiation, e.g., cardiac,epithelial, or neuronal cell maintenance, growth or differentiation, viatheir indirect effect on cAMP-dependent protein kinases, particularlyprotein kinase A, as described in, for example, Devlin (1997) Textbookof Biochemistry with Clinical Correlations (Wiley-Liss, Inc., New York,N.Y.). In addition, the receptors which trigger activity of theadenylate cyclases of the present invention are targets of drugs asdescribed in Goodman and Gilman (1996), The Pharmacological Basis ofTherapeutics (9^(th) ed.) Hartman & Limbard Editors, the contents ofwhich are incorporated herein by reference. Particularly, the adenylatecyclase molecules of the invention may modulate phosphorylation activityin tissues in which the polypeptides are highly expressed, including butnot limited to skeletal muscle, heart, cervix, vein, brain, pancreas,breast, fetal kidney, fetal liver, and fetal heart.

[0292] Furthermore, 21529 expression may be modulated in tissues inwhich the 21529 polypeptides are expressed including, but not limitedto, skeletal muscle, heart, cervix, vein, brain, pancreas, breast, fetalkidney, fetal liver, and fetal heart, which provides a profile ofexpression in normal human tissues. In addition, upregulation isobserved in breast carcinoma. Therefore, modulation is particularlyrelevant in this disorder. Further, 21529 downregulation is shown inboth lung and colon carcinoma. Therefore, modulation is also relevant inthese tissues. In colonic liver metastases, however, there issignificant upregulation. Accordingly, modulation is important in thesetissues. Furthermore, 21529 expression occurs in cardiovascular tissues,such as, but are not limited to, aorta, aorta with intimal proliferation(atheroplaques), coronary artery, internal mammary artery, heart,especially heart derived from patients with congestive heart failure andheart tissue derived from myopathic patients, ischemic heart, andsaphenous vein, (the chief superficial vein found in the human leg).Finally, as further discussed herein, the 21529 gene is expressed inhypertrophic cardiac myocytes from diseased subjects. Accordingly, 21529modulation is particularly relevant in disorders that include but arenot limited to congestive heart failure, ischemia, hypertension,myocardial infarction, atherosclerosis, cardiomyopathy, and otherdiseases of the cardiovascular system as disclosed herein.

[0293] In a preferred embodiment, the adenylate cyclase molecules of theinvention are used to modulate the cyclic AMP (cAMP) signal transductionpathway. Cyclic AMP is a second messenger produced in response toligand-induced stimulation of certain G-protein-coupled receptors(GPCR). In the cAMP signal transduction pathway, binding of a ligand toa GPCR leads to the activation of adenylate cyclase, which thencatalyzes the synthesis of cAMP. The newly synthesized cAMP can in turnactivate a cAMP-dependent protein kinase, such as protein kinase A. Theactivated cAMP-dependent kinases can, through a series of intermediatesteps, regulate transcription factors and stimulate expression of targetgenes, as well as phosphorylate other downstream target proteins thatare involved in a host of metabolic pathways. In addition, activatedcAMP-dependent protein kinases can phosphorylate a voltage-gatedpotassium channel protein and lead to the inability of the potassiumchannel to open during an action potential. The inability of thepotassium channel to open results in a decrease in the outward flow ofpotassium, which normally repolarizes the membrane of a neuron, leadingto prolonged membrane depolarization.

[0294] Cyclic AMP also influences cardiovascular physiology. Forinstance, cAMP activates protein kinase A (PKA). The activated subunitsof PKA initiate a series of enzymatic reactions that ultimately activatemultiple proteins that regulate both the rate and force of cardiaccontraction. For instance, phosphorylation of the L-type calcium channelenhances calcium entry into cardiocytes leading to increasedcontractility. Upon phosphorylation of phospholamban, the inhibitionexerted by the non phosphorylated form of phospholamban on thesarcoplasmic reticulium calcium pump is removed, and its rate of calciumuptake increased, thereby leading to a more rapid decrease of thecytosolic calcium concentration during diastole. Dissociation of thetroponin C-calcium complex is also enhanced when troponin I isphosphorylated which leads to an accelerated relaxation rate. Suchevents result in the enhancement of cardiac output. This process rapidlyreverses when agonist occupancy of the receptor ceases, i.e. thereuptake of norepinephrine into presynaptic stores. For a review, seefor example, Yoshihiro et al. (1997) Circulation Research 80:297-304 andCastellano et al. (1997) Hypertension 29:715-722.

[0295] As the enzyme that catalyzes conversion of intracellular ATP tocAMP, adenylate cyclase plays a central role in the regulation ofcellular cAMP concentrations. Disruption or modulation of adenylatecyclase activity affects intracellular concentrations of cAMP, which canin turn modulate the cAMP signal transduction pathway.

[0296] Many cardiovascular patho-physiological conditions result frommodulations in the cAMP signaling pathway. Therefore, changes inconcentration and function of receptors, G-proteins, and adenylatecyclase may thus constitute fundamental defects underlying certaincardiac diseases.

[0297] Alterations that accompany physiological changes incardiovascular function include, for example, transformations of themyocardial structure and function such as a transition of the myosinheavy chain isoform (Imumo et al. (1987) J Clin Invest 79:970977),accumulation of alpha-skeletal muscle actin mRNA (Schwartz et al. (1986)Circ Res 59:551-555) changes in troponin isoforms (Mayer et al. (1995)Curr Opin Cardiol 10:238-245) deterioration of Na+K+-ATPases(Charlemagne et al. (1986) J Biol Chem 261:185-189) and collagenremodeling of myocardium (Wever et al. (1988) Circ Res 62:757-763).Further changes in physiological cardiovascular function resulting fromvarious forms of heart failure include alterations in arterial tone andreactivity and alterations in platelet function including aggregation,secretion, and clot formation and blood pressure elevation. (Marcil etal. (1996) Hypertension 28:83-90).

[0298] Adenylate cyclase has been implicated in many cardiovasculardiseases. For example, adenylate cyclase activity and its responsivenessto various hormones is altered in hypertensive patients. Aberrantadenylate cyclase levels in hypertensive patients were restored towardnormal following antihypertensive drug therapy (Marcil et al. (1996)Hypertension 28:83-90). In addition, studies of heart in human andanimal models indicate adenylate cyclase has function in cardiomyopathy(Michael et al. (1995) Hypertension 25:962-970, Roth et al (1999)Circulation 99:3099-3099), ischemia (Sandhu et al. (1996) CirculationResearch 78:137-147), myocardial infarction (Espinasse et al. (1999)Cardiovascular Research 42:87-98) and congestive heart failure (Kawahiraet al. (1998) Circulation 98:262-267, Panza et al. (1995) Circulation91:1732-1738). Additionally, studies have indicated that adenylatecyclase has function in clinical situations resulting in myocardialdysfunction such as cardiopulmonary bypass (Booth et al. (1998)Anesthesiology 89: 602-611). Decreased concentrations of adenylatecyclase also occur in chronic pacing-induced heart failure (Ishikawa etal. (1994) J Clin Invest 93:2224-9), whereas changes in activity ofadenylate cyclase isoforms occur with activation of PKC (Kawabe et al.(1994) J Biol Chem 169: 16554-8), PKA (Chen et al. (1997) PNAS 94:14100-4), aging and in pressure-overload failing right ventricles(Bristow et al. (1992) J Clin Invest 89:803-15).

[0299] As the enzyme that catalyzes conversion of intracellular ATP tocAMP, adenylate cyclase plays a central role in the regulation ofcellular cAMP concentrations. Disruption or modulation of adenylatecyclase activity affects intracellular concentrations of cAMP, which canin turn modulate the cAMP signal transduction pathway. Modulation ofthis pathway can disrupt or alter cellular metabolism, growth, anddifferentiation, potentially leading to cellular growthrelated-disorders. As used herein, a “cellular growth-related disorder”includes a disorder, disease, or condition characterized by aderegulation, e.g., an upregulation or a downregulation, of cellulargrowth. Cellular growth deregulation may be due to a deregulation ofcellular proliferation, cell cycle progression, cellular differentiationand/or cellular hypertrophy. Examples of cellular growth relateddisorders include cardiovascular disorders such as heart failure,hypertension, atrial fibrillation, dilated cardiomyopathy, idiopathiccardiomyopathy, or angina; proliferative disorders or differentiativedisorders such as cancer, e.g., melanoma, prostate cancer, cervicalcancer, breast cancer, colon cancer, or sarcoma. Disorders associatedwith the tissues in which 21529 is expressed are also encompassed,especially skeletal muscle, heart, aorta, cervix, vein, brain, pancreas,and fetal kidney. Other disorders include tumors of the breast, lung,and colon. Disorders that are particularly relevant with respect toexpression of the adenylate cyclase are cardiovascular disorders. Asdescribed above, the 21529 adenylate cyclase is expressed in humancardiovascular tissues. Further, the 21529 gene is highly expressed inhypertrophic cardiac myocytes. Accordingly, disorders that are relevantinclude hypertension, atherosclerosis, ischemia, cardiomyopathy,congestive heart failure, myocardial infarction, and diseases of thecardiovascular system as disclosed herein.

[0300] The disclosed invention relates to methods and compositions forthe modulation, diagnosis, and treatment of adenylate cyclase-associatedor related disorders, particularly disorders resulting from aberrationsin components of the cAMP signal transduction pathway, such ascAMP-dependent disorders, and disorders associated with cAMP-dependentprotein kinases. Such disorders include, but are not limited to,disorders involving the skeletal muscle, heart, cervix, blood vessels,brain, pancreas, and cardiovascular system. Further relevant disordersinclude disorders involving the breast, and especially tumors of thebreast.

[0301] Specifically, the present invention provides isolated nucleicacid molecules comprising nucleotide sequences encoding the 21529adenylate cyclase polypeptide whose amino acid sequence is given in SEQID NO:47, or a variant or fragment of the polypeptide. A nucleotidesequence encoding an adenylate cyclase polypeptide of the invention,more particularly the polypeptide of SEQ ID NO:47, is set forth in SEQID NO:46 and 48.

[0302] A novel human gene, termed clone 21529 is provided. Thissequence, and complements thereof, are referred to as “adenylatecyclase” sequences indicating that the gene sequences share sequencesimilarity to adenylate cyclase genes.

[0303] The novel 21529 adenylate cyclase gene encodes an approximately3.52 Kb mRNA transcript having the corresponding cDNA set forth in SEQID NO:46. This transcript has a 3231 nucleotide open reading frame(nucleotides 247-3477 of SEQ ID NO:46; nucleotides 1-3231 of SEQ IDNO:48), which encodes a 1077 amino acid protein (SEQ ID NO:47). Ananalysis of the full-length 21529 polypeptide predicts that theN-terminal 50 amino acids may represent a region comprising a signalpeptide. MEMSAT program analysis of the full-length 21529 polypeptidepredicted transmembrane segments at amino acid residues (aa) 27-50,61-79, 92-113, 120-136, 143-160, 174-190, 365-381, 408-424, 589-605,612-631, 664-685, 713-736, 744-760, and 790-807 of SEQ ID NO:47.Transmembrane segments for the presumed mature peptide (aa 51-1077) werepredicted at aa 11-29, 42-63, 70-86, 93-110, 124-140, 315-331, 358-374,539-555, 562-581, 614-635, 663-686, 694-710, and 740-757 of SEQ IDNO:47. Prosite program analysis was used to predict various sites withinthe 21529 protein. N-glycosylation sites were predicted at aa 697-700,704-707, 836-839, and 938-941 of SEQ ID NO:47, with the actual modifiedresidue being the first amino acid. Protein kinase C phosphorylationsites were predicted at aa 6-8, 51-53, 202-204, 212-214, 218-220,290-292, 526-528, 550-552, and 606-608 of SEQ ID NO:47, with the actualmodified residue being the first amino acid. Casein kinase IIphosphorylation sites were predicted at aa 51-54, 115-118, 202-205,253-256, 290-293, 333-336, 359-362, 465-468, 495-498, 687-690, 878-881,919-922, 941-944, 958-961, 968-971, and 1015-1018 of SEQ ID NO:47, withthe actual modified residue being the first amino acid. Tyrosine kinasephosphorylation sites were predicted at aa 318-325, 437-444, 570-576,and 859-865 of SEQ ID NO:47, with the actual modified residue being thelast amino acid. N-myristoylation sites were predicted at aa 35-40,111-116, 137-142, 145-150, 184-189, 329-334, 345-350; 360-365, 368-373,402-407, 412-417, 458-463, 654-659, 661-666, 936-941, 995-1000,1008-1013, and 1055-1060 of SEQ ID NO:47, with the actual modifiedresidue being the first amino acid. A prokaryotic membrane lipoproteinlipid attachment site was predicted at aa 745-755 of SEQ ID NO:47, and aleucine zipper pattern was predicted at aa 55-76 of SEQ ID NO:47.Guanylate cyclase signature sequences were predicted at aa 377-400 and995-1018 of SEQ ID NO:47.

[0304] The 21529 adenylate cyclase protein possesses twoadenylate/guanylate cyclase catalytic domains, from aa 264-448 and aa864-1064 of SEQ ID NO:47, as predicted by HMMer, Version 2. Other domainmatches predicted by HMMer included a copper/zinc superoxide dismutasedomain, from aa 376-383 of SEQ ID NO:47, and a eubacterial secY proteindomain, from aa 60-385 of SEQ ID NO:47.

[0305] The 21529 protein displays closest similarity to the ratadenylate cyclase IV (CYA4) (SP Accession Number P26770), approximately86% identity over their 1075 amino acid overlap.

[0306] A plasmid containing the 21529 cDNA insert was deposited with thePatent Depository of the American Type Culture Collection (ATCC), 10801University Boulevard, Manassas, Va., on Apr. 6, 2000, and assignedPatent Deposit Number PTA-1661. This deposit will be maintained underthe terms of the Budapest Treaty on the International Recognition of theDeposit of Microorganisms for the Purposes of Patent Procedure. Thisdeposit was made merely as a convenience for those of skill in the artand is not an admission that a deposit is required under 35 U.S.C. §112.

[0307] The 21529 adenylate cyclase sequences of the invention aremembers of a family of molecules having conserved functional features.The term “family” when referring to the proteins and nucleic acidmolecules of the invention is intended to mean two or more proteins ornucleic acid molecules having sufficient amino acid or nucleotidesequence identity as defined herein. Such family members can benaturally occurring and can be from either the same or differentspecies. For example, a family can contain a first protein of murineorigin and a homolog of that protein of human origin, as well as asecond, distinct protein of human origin and a murine homolog of thatprotein. Members of a family may also have common functionalcharacteristics.

[0308] Another embodiment of the invention features isolated adenylatecyclase proteins and polypeptides having an adenylate cyclase proteinactivity. As used interchangeably herein, a “adenylate cyclase proteinactivity”, “biological activity of an adenylate cyclase protein”, or“functional activity of an adenylate cyclase protein” refers to anactivity exerted by an adenylate cyclase protein, polypeptide, ornucleic acid molecule on an adenylate cyclase responsive cell asdetermined in vivo, or in vitro, according to standard assay techniques.An adenylate cyclase activity can be a direct activity, such asconversion of intracellular ATP to cAMP, or an indirect activity, suchas a cellular activity mediated by generation of cAMP, such as anydownstream cellular response associated with the cAMP signaltransduction pathway. In a preferred embodiment, a 21529 adenylatecyclase activity includes at least one or more of the followingactivities: (1) modulating (stimulating and/or enhancing or inhibiting)cellular growth, differentiation, and/or function, particularly in cellsin which the sequences are expressed, for example, cells of the skeletalmuscle, heart, cervix, vein, brain, pancreas, fetal kidney, and breasttumors, and cardiovascular tissue; a protein kinase A cellular effect,such as release of hormones, glycogen metabolism, such as in liver,heart, and skeletal muscles; (2) modulating the cAMP signal transductionpathway; (3) modulating a target cell's cAMP concentration; (4)modulating cAMP-dependent protein kinase activity, such as proteinkinase A; and (5) modulating the release of hormones, such as release ofcortisol in the adrenal gland cells, thyroid hormones from the thyroidgland, testosterone from testicular Leydig cells, and melatonin from thepineal gland.

[0309] Isolation of 21529

[0310] Clone 21529 was isolated from a human spleen or heart cDNAlibrary. The identified clone 21529 encodes a transcript ofapproximately 3.52 Kb (corresponding cDNA set forth in SEQ ID NO:46).The open reading frame (nucleotides 247-3477 of SEQ ID NO:46;nucleotides 1-3231 of SEQ ID NO:48) of this transcript encodes apredicted 1077 amino acid protein (SEQ ID NO:47). This novel gene ispreliminarily mapped to human chromosome 14 using the mapping panelGenebridge 4 human RH.

[0311] A search of the nucleotide and protein databases revealed that21529 encodes a polypeptide that shares similarity with severaladenylate cyclases, the greatest similarity being seen with the ratadenylate cyclase type IV protein (SP Accession Number P26770). Analignment of the 21529 polypeptide with this rat protein, using theClustal method with PAM250 residue weight table, demonstrates theoverall close similarity between the two sequences and indicates that21529 is the human ortholog of the rat adenylate cyclase type IV.

[0312] mRNA Expression of Clone 21529

[0313] Expression of the novel 21529 adenylate cyclase was measured byTaqMan quantitative PCR (Perkin Elmer Applied Biosystems) in cDNAprepared from the following normal human tissues: thymus, skeletalmuscle, liver, lung, thyroid, heart, ovary, aorta, placenta, cervix,lymph node, vein, brain, esophagus, pancreas, kidney, brain, prostate,liver, spleen, breast, colon, tonsil, small intestine, fetal kidney,fetal liver, fetal heart, and testis.

[0314] Probes were designed by PrimerExpress software (PE Biosystems)based on the 21529 sequence. The primers and probes for expressionanalysis of 21529 and β-2 microglobulin were as follows: 21529 ForwardPrimer AGCTGTGGCCCAGTTAATGG 21529 Reverse Primer CTTTGGCCCCTTCCAGGTT21529 TaqMan Probe CTACCGACTGGCGGTCATTGCCAG β-2 microglobulin ForwardPrimer CACCCCCACTGAAAAAGATGA β-2 microglobulin Reverse PrimerCTTAACTATCTTGGGCTGTGACAAAG β-2 microglobulin TaqMan ProbeTATGCCTGCCGTGTGAACCACGTG

[0315] The 21529 sequence probe was labeled using FAM(6-carboxyfluorescein), and the β2-microglobulin reference probe waslabeled with a different fluorescent dye, VIC. The differential labelingof the target adenylate cyclase sequence and internal reference genethus enabled measurement in the same well. Forward and reverse primersand the probes for both β2-microglobulin and the target 21529 sequencewere added to the TaqMan® Universal PCR Master Mix (PE AppliedBiosystems). Although the final concentration of primer and probe couldvary, each was internally consistent within a given experiment. Atypical experiment contained 200 nM of forward and reverse primers plus100 nM probe for β-2 microglobulin and 600 nM forward and reverseprimers plus 200 nM probe for the target 21529 sequence. TaqMan matrixexperiments were carried out on an ABI PRISM 7700 Sequence DetectionSystem (PE Applied Biosystems). The thermal cycler conditions were asfollows: hold for 2 min at 50° C. and 10 min at 95° C., followed bytwo-step PCR for 40 cycles of 95° C. for 15 sec followed by 60° C. for 1min.

[0316] The following method was used to quantitatively calculate 21529expression in the various tissues relative to β-2 microglobulinexpression in the same tissue. The threshold cycle (Ct) value is definedas the cycle at which a statistically significant increase influorescence is detected. A lower Ct value is indicative of a highermRNA concentration. The Ct value of the 21529 sequence is normalized bysubtracting the Ct value of the β-2 microglobulin gene to obtain a_(Δ)Ct value using the following formula:_(Δ)Ct=Ct_(h21529)−Ct_(β-2 microglobulin). Expression is then calibratedagainst a cDNA sample showing a comparatively low level of expression ofthe 21529 sequence. The _(Δ)Ct value for the calibrator sample is thensubtracted from _(Δ)Ct for each tissue sample according to the followingformula: _(ΔΔ)Ct=_(Δ)Ct−_(sample)−_(Δ)Ct−_(calibrator). Relativeexpression is then calculated using the arithmetic formula given by2^(−ΔΔCt). Expression of the target 21529 sequence in each of thetissues tested was then analyzed as discussed in more detail below.

[0317] The mRNA for the putative adenylate cyclase 21529 isdifferentially expressed in all of the normal tissues tested. There wassignificant expression in pancreas, vein, brain, heart, and skeletalmuscle; moderate expression in cervix, fetal kidney, fetal heart, liver,placenta, thyroid, ovary, breast, aorta, and brain; and lower expressionin lymph node, esophagus, kidney, lung, spleen, testis, small intestine,fetal liver, colon, prostate, thymus, and tonsil. These data indicatethis novel adenylate cyclase has a widely dispersed pattern ofexpression, a characteristic in common with the rat adenylate cyclase IVhomolog.

[0318] TaqMan data obtained using an Oncology panel wherein normalbreast, normal lung, normal colon and normal liver tissue samples werecompared to breast tumor, lung tumor, colon tumor and liver tumorsamples, respectively, demonstrated that 21529 was upregulated in breastand colon tumor samples compared to their respective normal tissuesamples.

[0319] mRNA Expression of Clone 21529 in Human Cardiovascular Tissues

[0320] mRNA was hybridized as discussed above in the followingcardiovascular tissues: aorta, aorta with intimal proliferation,coronary artery, mammary internal artery, heart, congestive heartfailure heart samples, ischemic heart samples, myopathic heart samples,and saphenous vein. These were compared in terms of relative expressionto the expression of the gene in skeletal muscle. Highest expression wasobserved in tissue from congestive heart failure patients and myopathichearts. Significant expression was also observed in coronary artery andin the internal mammary artery. Further, significant expression was alsoobserved in ischemic heart. Lower levels of expression were observed inthe remainder of the tissues.

[0321] Further, in situ hybridization experiments were done againsthypertrophic cardiac myocytes from diseased hearts. Results showedincreased expression of the gene in the hypertrophic myocytes.

[0322] Human 26176

[0323] The present invention is based, at least in part, on thediscovery of a novel calpain protease referred to herein as “26176”. Thepresent invention provides isolated nucleic acid molecules comprisingnucleotide sequences encoding the 26176 calpain protease polypeptidewhose amino acid sequence is given in SEQ ID NO:50, or a variant orfragment of the polypeptide. A nucleotide sequence encoding the 26176calpain protease polypeptides of the invention is set forth in SEQ IDNO:49. The sequences are members of the calpain family of thiolproteases, also referred to as the peptidase family C2.

[0324] Calpains refer to calcium-activated neutral proteinases, asuperfamily of endopeptidases typically having cysteine-proteinase andcalcium-binding characteristics. These proteinases cleave numeroussubstrate proteins in a limited manner, typically leading tomodification of the function and/or activity rather than generaldegradation of the substrate.

[0325] Calpains are classified into two main groups, the typical orconventional calpains and the a typical calpains, based on their domaincontent and/or variation. The typical calpains are further subdividedinto ubiquitous and tissue-specific calpains based on their predominatepatterns of expression.

[0326] Two forms of ubiquitous calpains have been extensivelycharacterized in vertebrates: the μ-calpains (calpain I, CAPN1) and them-calpains (calpain II, CAPN2), which are activated in vitro by micro-and millimolar calcium concentrations, respectively. An intermediate μ/mcalpain has been characterized in chicken.

[0327] The ubiquitous μ- and m-calpains are heterodimers, each having adistinct, but homologous, large 80 kDa subunit (referred to as μCL ormCL, respectively) and an identical small 30 kDa subunit (referred to as30K or Cs). The large subunit has four domains, designated I-IV from theN-terminus to the C-terminus. The function of domain I is unclear.Domain II is the cysteine protease domain responsible for calpainprotease activity. Domain III is homologous to a calmodulin-bindingprotein and is speculated to interact with the calcium-binding domainsof the large (domain IV) and small subunits (domain VI), when calcium isbound, thereby freeing the protease domain for activity (Goll et al.(1992) BioEssays 14:549-556). Domain IV of the large subunit is acalmodulin-like calcium binding domain containing four EF-handcalcium-binding motifs. Although structurally similar to calmodulin,domain IV is more similar to sorcin, ALG-2, and grancalcin. Sorcin isinvolved in the multi-drug resistance of cultured cell lines and wasrecently reported to associate with the cardiac ryanodine receptor.Grancalcin possibly plays a role in granule-membrane fusion anddegranulation. ALG-2 is thought to be involved in apoptosis and isinduced by tumor promoters. See Meyers et al. (1995) J. Biol. Chem.270:26411-26418; Meyers et al. (1985) J. Cell Biol. 100:588-597; Vito etal. (1996) Science 271:521-525; Teahan et al. (1992) Biochem. J.286:549-554; Boyhan et al. (1992) J. Biol. Chem. 267:2928-2933.

[0328] The small subunit of typical calpains contains two domains, whichare designated V and VI from the N-terminus to the C-terminus. Domain Vis an N-terminal glycine-clustering hydrophobic region. Domain VI, whichis similar to domain IV of the large subunit, is also a calcium-bindingdomain containing six EF-hands, EF2-EF5 as in the large subunit, and EF1and EF6. EF5 of domain VI does not bind calcium and is proposed to beinvolved in the heterodimeric binding of domains IV and VI duringinteraction between the large and small subunits.

[0329] Calpastatin is an endogenous inhibitor of most calpains, thetissue-specific calpain p94 being an exception. Calpastatin, which hasfive domains, is cleaved by calpain in the interdomain regions,generating inhibitory peptides. The inhibitory effect of calpastatin hasbeen attributed to interactions with calpain domains II, III, IV, andVI. The reactive site of calpastatin shows no apparent homology to thatof other protease inhibitors, and it contains the consensus sequenceTIPPXYR (SEQ ID NO:52), which is essential for inhibition. See Kawasakiet al. (1989) J. Biochem. 106:274-281; Croall et al. (1994) Biochem.33:13223-13230; Croall et al. (1991) Physiol. Rev. 71:813-847; Kawasakiet al. (1996) Mol. Membr. Biol. 13:217-224; Melloni et al. (1989) TrendsNeurosci. 12:438-444; Sorimachi et al. (1997) J. Biochem. 328:721-732;and Johnson et al. (1997) BioEssays 19(11):1011-1018.

[0330] Several typical tissue-specific calpains are known invertebrates, including skeletal muscle p94 (nCL-1, calpain 3′, CAPN3),stomach nCL2 (CAPN4) and nCL 2′, and digestive tubule nCL4. While p94contains EF hands, it does not require calcium for proteinase activity.p94 has a domain IV sequence similar to that of μCL and mCL, but it doesnot bind to a small 30 kDa subunit (Kinbara et al. (1997) Arch. Biochem.Biophys. 342:99-107). p94 contains unique insertion sequences called IS1and IS2, which are found in domain II and between domains III and IV,respectively). IS2 contains a nuclearlocalization-signal-like basicsequence (Arg-Pro-Xaa-Lys-Lys-Lys-Lys-Xaa-Lys-Pro (SEQ ID NO:53)).Connectin/titin binding is also attributed to IS2. p94 may change itslocalization in a cell-cycle dependent manner and may be involved inmuscle differentiation by interacting with the MyoD family. In fact, adefect in the protease p94 is responsible for limb-girdle musculardystrophy type 2A (LGMD2A). See Sorimachi et al. (1995) J. Biol. Chem.270:31158-31162; Sorimachi et al. (1993) J. Biol. Chem. 268:10593-10605;Gregoriou et al. (1994) Eur. J. Biochem. 223:455-464; and Belcastro etal. (1998) Mol. Cell. Biochem. 179 (1, 2):135-145.

[0331] Calpains have broad physiological and pathological roles relatedto the enzymes' diverse population of substrates. Calpain substratesinclude “PEST” proteins, which have high proline, glutamine, serine, andthreonine contents; calpain and calpastatin; signal transductionproteins including protein kinase C, transcription factors c-Jun, c-Fos,and a-subunit of heterotrimeric G proteins; proteins involved in cellproliferation and cancer including P53 tumor suppressor, growth factorreceptors (eg., epidermal growth factor receptor), c-Jun, c-Fos, andN-myc; proteins with established physiological roles in muscle includingCa⁺⁺-ATPase, Band III, troponin, tropomyosin, and myosin light chainkinase; myotonin protein kinase; proteins with established physiologicalroles in the brain and the central nervous system including myelinproteins, myelin basic protein (MBP), axonal neurofilament protein(NFP), myelin protein MAG; cytosketetal and cell adhesion proteinsincluding troponins, talin, neurofilaments, spectrin, microtubuleassociated protein MAP-2, tau, MAPIB, fodrin, desmin, α-actinin,vimentin, spectrin, integrin, cadherin, filamin, and N-CAM; enzymesincluding protein kinases A and C, and phospholipase C; and histones.

[0332] See Sorimachi et al. (1997) J. Biochem. 328:721-732; Johnson etal. (1997) BioEssays 19(11):1011-1018; Shields et al. (1999) J.Neuroscience Res. 55(5):533-541; and Belcastro et al. (1998) Mol. Cell.Biochem. 179 (1, 2):135-145.

[0333] Calpain is implicated in a wide variety of physiologicalprocesses including alteration of membrane morphology, long-termpotentiation of memory, axonal regeneration, neurite extension, cellproliferation (division), gastric HCl secretion, embryonic development,secretory granule movement, cell differentiation and regulation,cytoskeletal and membrane changes during cell migration, cytoskeletalremodeling, sex determination, and alkaline adaptation in fungi. SeeSolary et al. (1998) Cell Biol. Toxicol. 14:121-132; Sorimachi et al.(1997) J. Biochem. 328:721-732; Johnson et al. (1997) BioEssays19(11):1011-1018; Suzuki et al. (1998) FEBS Letters 433(1, 2):1-4; Franzet al. (1999) Mammalian Genome 10(3):318-321; Shields et al. (1999) J.Neuroscience Res. 55(5):533-541; Schnellmann et al. (1998) Renal Failure20(5):679-686; Banik et al. (1998) Annals New York Acad. Sci.844:131-137; Belcastro et al. (1998) Mol. Cell. Biochem. 179 (1,2):135-145; and McIntosh et al. (1998) J. Neurotrauma 15(10):731-769.

[0334] Under pathological conditions, aberrant regulation and/oractivity of calpain can be detrimental to cells and tissues. In thiscontext, calpains are implicated in a wide variety of disease statesincluding exercise-induced injury and repair; apoptosis including T cellreceptor-induced apoptosis, HIV-infected cell apoptosis,ectoposide-treated cell apoptosis, nerve growth factor deprived neuronalapoptosis; ischemia, such as cerebral and myocardial ischemia; traumaticbrain injury; Alzheimer's disease and other neurodegenerative diseases;demyelinating diseases including experimental allergic encephalomyelitis(EAE) and multiple sclerosis; LGMD2A muscular dystrophy; spinal cordinjury (SCI); cancer; cataract formation; and renal cell death bydiverse toxicants.

[0335] The disclosed invention relates to methods and compositions forthe modulation, diagnosis, and treatment of calpain protease-mediateddisorders. Such disorders include, but are not limited to, disordersassociated with perturbed cellular growth and differentiation;exercise-induced injury and repair; apoptosis including T-cellreceptor-induced apoptosis, HIV-infected cell apoptosis,ectoposide-treated cell apoptosis, nerve growth factor deprived neuronalapoptosis; ischemia; traumatic brain injury; Alzheimer's disease andother neurodegenerative diseases; demyelinating diseases includingexperimental allergic encephalomyelitis (EAE) and multiple sclerosis;LGMD2A muscular dystrophy; spinal cord injury (SCI); proliferativedisorders or differentiative disorders such as cancer, e.g., melanoma,prostate cancer, cervical cancer, breast cancer, colon cancer, orsarcoma; and renal cell death associated with diverse toxicants.

[0336] The sequences of the invention find use in diagnosis of disordersinvolving an increase or decrease in protease expression relative tonormal expression, such as a proliferative disorder, a differentiativedisorder, or a developmental disorder. The sequences also find use inmodulating protease-related responses. By “modulating” is intended theupregulating or downregulating of a response. That is, the compositionsof the invention affect the targeted activity in either a positive ornegative fashion.

[0337] One embodiment of the invention features protease nucleic acidmolecules, preferably human protease molecules, which were identifiedbased on a consensus motif or protein domain characteristic of thecalpain family of thiol proteases. Specifically, a novel human gene,termed clone 26176, is provided. This sequence, and other nucleotidesequences encoding the 26176 protein or fragments and variants thereof,are referred to as “calpain protease sequences” indicating that thesequences share sequence similarity to other calpain protease genes.

[0338] The calpain protease gene designated clone 26176 was identifiedin a human T-cell cDNA library. Clone 26176 encodes an approximately3.78 Kb mRNA transcript having the corresponding cDNA set forth in SEQID NO:49. This transcript has a 2439 nucleotide open reading frame(nucleotides 276-2714 of SEQ ID NO:49; nucleotides 1-2439 of SEQ IDNO:51), which encodes an 813 amino acid protein (SEQ ID NO:50). MEMSATanalysis of the full-length 26176 polypeptide predicts a transmembranesegment from amino acids (aa) 286-302 of SEQ ID NO:50. Prosite programanalysis was used to predict various sites within the 26176 protein. AnN-glycosylation site was predicted at aa 366-369 of SEQ ID NO:50 withthe actual residue being the first residue. A cAMP- and cGMP-dependentprotein kinase phosphorylation site was predicted at aa 759-762 of SEQID NO:50 with the actual phosphorylated residue being the last residue.Protein kinase C phosphorylation sites were predicted at aa 165-167,215-217, 251-253, 281-283, 422-424, 594-596, 668-670, 689-691, and710-712 of SEQ ID NO:50 with the actual phosphorylated residue being thefirst residue. Casein kinase II phosphorylation sites were predicted ataa 4-7, 48-51, 123-126, 205-208, 373-376, 393-396, 445-448, 490-493,523-526, 551-554, 594-597, 657-660, 748-751, and 761-764 of SEQ ID NO:50with the actual phosphorylated residue being the first residue. Tyrosinekinase phosphorylation sites were predicted at aa 20-26 and aa 320-326of SEQ ID NO:50 with the actual phosphorylated residue being the last.N-myristoylation sites were predicted at aa 201-206, 390-395, 453-458,630-635, and 698-703 of SEQ ID NO:50 with the actual modified residuebeing the first. An amidation site was predicted at aa 614-617 of SEQ IDNO:50. The calpain protease protein 26176 possesses a calpain familycysteine protease domain (domain II), from aa 231-537 of SEQ ID NO:50,and a calpain large subunit domain III, from aa 685-810 of SEQ ID NO:50,as predicted by HMMer, Version 2.

[0339] The protein displays the closest similarity to the human genedesignated PaIBH, (Accession Numbers GPU:gi [5102944] dbj [BAA78730](AB028639).

[0340] The 26176 protein also displays similarity to the murine CAPN7protein, approximately 93% identity and 95% overall similarity over a768 amino acid overlap (amino acid residues 45-813 of the 26176 protein(SEQ ID NO:50)), indicating 26176 is the human ortholog of this murineprotein.

[0341] A plasmid containing the 26176 cDNA insert was deposited with thePatent Depository of the American Type Culture Collection (ATCC), 10801University Boulevard, Manassas, Va., on Apr. 6, 2000, and assignedPatent Deposit Number PTA-1649. This deposit will be maintained underthe terms of the Budapest Treaty on the International Recognition of theDeposit of Microorganisms for the Purposes of Patent Procedure. Thisdeposit was made merely as a convenience for those of skill in the artand is not an admission that a deposit is required under 35 U.S.C. 112.

[0342] The calpain protease sequences of the invention are members of aprotease family of molecules having conserved functional features. Theterm “family” when referring to the proteins and nucleic acid moleculesof the invention is intended to mean two or more proteins or nucleicacid molecules having sufficient amino acid or nucleotide sequenceidentity as defined herein. Such family members can be naturallyoccurring and can be from either the same or different species. Forexample, a family can contain a first protein of murine origin and anortholog of that protein of human origin, as well as a second, distinctprotein of human origin and a murine ortholog of that protein. Membersof a family may also have common functional characteristics.

[0343] Preferred 26176 calpain protease polypeptides of the presentinvention have an amino acid sequence sufficiently identical to theamino acid sequence of SEQ ID NO:50. The term “sufficiently identical”is used herein to refer to a first amino acid or nucleotide sequencethat contains a sufficient or minimum number of identical or equivalent(e.g., with a similar side chain) amino acid residues or nucleotides toa second amino acid or nucleotide sequence such that the first andsecond amino acid or nucleotide sequences have a common structuraldomain and/or common functional activity. For example, amino acid ornucleotide sequences that contain a common structural domain having atleast about 45%, 55%, or 65% identity, preferably 75% identity, morepreferably 85%, 95%, or 98% identity are defined herein as sufficientlyidentical.

[0344] Another embodiment of the invention features isolated calpainprotease proteins and polypeptides having a calpain protease proteinactivity. As used interchangeably herein, a “calpain protease proteinactivity”, “biological activity of a calpain protease protein”, or“functional activity of a calpain protease protein” refers to anactivity exerted by a calpain protease protein, polypeptide, or nucleicacid molecule on a calpain-protease-responsive cell as determined invivo, or in vitro, according to standard assay techniques. A calpainprotease activity can be a direct activity, such as an association withor an enzymatic activity on a second protein, or an indirect activity,such as a cellular signaling activity mediated by interaction of thecalpain protease protein with a second protein. In a preferredembodiment, a 26176 calpain protease activity includes at least one ormore of the following activities: (1) modulating (stimulating and/orenhancing or inhibiting) cellular proliferation, differentiation, and/orfunction (e.g., in cells in which it is expressed, for example, cellswithin normal and carcinoma tissues, such as lung, liver, colon, andbreast; brain and skeletal muscle cells, etc.); (2) modulating a calpainprotease response; (3) modulating the entry of cells into mitosis; (4)modulating cellular differentiation; and (5) modulating cell death.

[0345] Isolation of 26176

[0346] Clone 26176 was isolated from a human T-cell cDNA library. Theidentified clone 26176 encodes a transcript of approximately 3.78 Kb(corresponding cDNA set forth in SEQ ID NO:49). The open reading frame(nucleotides 276-2714 of SEQ ID NO:49; nucleotides 1-2439 of SEQ IDNO:51) of this transcript encodes a predicted 813 amino acid protein(SEQ ID NO:50)

[0347] A search of the nucleotide and protein databases revealed that26176 encodes a polypeptide that shares similarity with several calpainproteases, the greatest similarity being seen with the murine CAPN7protein (EMB Accession Number AJ012475).

[0348] mRNA Expression of Clone 26176

[0349] Expression of the novel 26176 calpain protease was measured byTaqMan quantitative PCR (Perkin Elmer Applied Biosystems) in cDNAprepared from the following human tissues: normal colon, coloncarcinoma, normal liver, colon metastasis, normal lung, lung carcinoma,normal breast, and breast carinoma.

[0350] Probes were designed by PrimerExpress software (PE Biosystems)based on the 26176 sequence. The primers and probes for expressionanalysis of 26176 and P-2 microglobulin were as follows: 26176 ForwardPrimer AATAGTATCGGATTGCTCCTTTGTG 26176 Reverse PrimerGCCGGTAATTAACTTCTTATTAAAACG 26176 TaqMan ProbeCATCACTGGCCATCAGTGCAGCTTATG β-2 microglobulin Forward PrimerCACCCCCACTGAAAAAGATGA β-2 microglobulin Reverse PrimerCTTAACTATCTTGGGCTGTGACAAAG β-2 microglobulin TaqMan ProbeTATGCCTGCCGTGTGAACCACGTG

[0351] The 26176 sequence probe was labeled using FAM(6-carboxyfluorescein), and the β2-microglobulin reference probe waslabeled with a different fluorescent dye, VIC. The differential labelingof the target calpain protease sequence and internal reference gene thusenabled measurement in the same well. Forward and reverse primers andthe probes for both β2-microglobulin and the target 26176 sequence wereadded to the TaqMan® Universal PCR Master Mix (PE Applied Biosystems).Although the final concentration of primer and probe could vary, eachwas internally consistent within a given experiment. A typicalexperiment contained 200 nM of forward and reverse primers plus 100 nMprobe for β-2 microglobulin and 600 nM forward and reverse primers plus200 nM probe for the target 26176 sequence. TaqMan matrix experimentswere carried out on an ABI PRISM 7700 Sequence Detection System (PEApplied Biosystems). The thermal cycler conditions were as follows: holdfor 2 min at 50° C. and 10 min at 95° C., followed by two-step PCR for40 cycles of 95° C. for 15 sec followed by 60° C. for 1 min.

[0352] The following method was used to quantitatively calculate 26176expression in the various tissues relative to β-2 microglobulinexpression in the same tissue. The threshold cycle (Ct) value is definedas the cycle at which a statistically significant increase influorescence is detected. A lower Ct value is indicative of a highermRNA concentration. The Ct value of the 26176 sequence is normalized bysubtracting the Ct value of the β-2 microglobulin gene to obtain a_(Δ)Ct value using the following formula: _(Δ)Ct=Ct_(h26176)−Ct_(β-2)microglobulin. Expression is then calibrated against a cDNA sampleshowing a comparatively low level of expression of the 26176 sequence.The _(Δ)Ct value for the calibrator sample is then subtracted from_(Δ)Ct for each tissue sample according to the following formula:_(ΔΔ)Ct=_(Δ)Ct−_(sample)−_(Δ)Ct−_(calbrator). Relative expression isthen calculated using the arithmetic formula given by 2^(−ΔΔCt).Expression of the target 26176 sequence in each of the tissues testedwas then analysed.

[0353] The mRNA for the putative calpain protease 26176 is expressed ina variety of tumors. There was significant upregulation in coloncarcinoma and breast carcinoma. Accordingly, expression of the 26176calpain protease is relevant to colon and breast carcinoma. Inadditional experiments, the gene was expressed in three out of fournormal lung tissue samples but in 15 out of 16 lung carcinoma clinicalsamples. Accordingly, expression of the 26176 calpain protease isrelevant to lung carcinoma as well. This is consistent with thehypothesis that proteases may function in carcinogenesis by inactivatingor activating regulators of cell cycle, differentiation, apoptosis, orother processes affecting cancer development and/or progression. In viewof the fact that the 26176 gene is upregulated in colon carcinoma, thegene is useful for inhibiting tumor progression. Inhibition ofexpression of this 26176 protease can thus be used to decrease theprogression of carcinogenesis.

[0354] In addition, Northern blot experiments showed expression of the26176 calpain protease in bone, ovary, T-cell, spleen, and kidneytissue. Accordingly, the 26176 protease is relevant to disordersinvolving these tissues.

[0355] In addition, 26176 expression has been observed in heart,neuronal tissue, monocytes, and prostate. Accordingly, expression of the26176 gene is relevant to disorders involving these tissues.

[0356] Finally, 26176 expression has been observed in parathyroid tumorand in thymus. Accordingly, detection of expression or modulation ofexpression of the 26176 gene in these tissues, and particularly indisorders involving these tissues, is relevant.

[0357] Human 26343

[0358] The present invention is based, at least in part, on thediscovery of novel molecules, referred to herein “OxidoreductaseProtein”, “OP” or “26343” nucleic acid and protein molecules, which arenovel members of a family of enzymes possessing oxidoreductase activity.These novel molecules are capable of oxidizing and/or reducing moleculargroups by catalyzing the transfer of a hydride moiety and, thus, play arole in or function in a variety of cellular processes, e.g.,proliferation, metabolism, differentiation, hormonal responses, andinter- or intra-cellular communication.

[0359] The oxidation and reduction of molecules is of criticalimportance in many cellular metabolic and catabolic pathways. “Redox”reactions play important roles in the production and breakdown of nearlyall major metabolic intermediates, including amino acids, vitamins,energy molecules (e.g., glucose, sucrose, and their breakdown products),signal molecules (e.g., transcription factors and neurotransmitters),and nucleic acids. A large class of enzymes which facilitate some ofthese molecular alterations, termed oxidoreductases, have beenidentified. In the forward reaction, these enzymes catalyze the transferof a hydride ion from the target substrate to the enzyme or a cofactorof the enzyme (e.g., NAD⁺, NADP⁺, FAD⁺), thereby oxidizing thesubstrate. These enzymes may also participate in the reverse reaction,wherein a molecular group of the target molecule is reduced by thetransfer of a hydride group from the enzyme. Members of theoxidoreductases family are found in nearly all organisms, fromprokaryotes to Drosophila to humans. Both between species and within thesame species, oxidoreductases vary widely; disparate family members arefrequently classified by the cofactor used by the enzyme (e.g., NAD⁺,NADP⁺, FAD⁺), or by the particular substrate(s) of the enzyme (see, forexample, Cavener, D. R. (1992) J. Mol. Biol. 223:811-814).

[0360] Different oxidoreductases are specific for a wide array ofbiological and chemical substrates. For example, there existoxidoreductases specific for steroids (Kass and Sampson (1998)Biochemistry 37:17990-800), neurotransmitters (Lamark et al. (1991) Mol.Microbiol. 5:1049-1064), energy metabolites (Krasney et al. (1990) Mol.Biol. Evol. 7:155-177; Frederick), alcohols (Ledeboer et al. (1985)Nucleic Acids Res. 13:3069-3082; Koutz et al. (1989) Yeast 5:167-177),lipids (Funk et al. (1992) Proc. Natl. Acad. Sci. USA 89:3962-3966),amino acid precursors and nucleotide precursors (Wright et al. (1993)Proc. Natl. Acad. Sci. USA 90:10690-10694). Accordingly, oxidoreductaseactivity contributes to the ability of the cell to grow anddifferentiate, to proliferate, and to communicate and interact withother cells. Therefore, a wide range of metabolic disorders and relatedpathogenic states relate to the oxidoreductases, both directly andindirectly (see, for example, Salazar et al. (1997) J. Biol. Chem.272:26425-26433).

[0361] As used herein, the term “oxidoreductase” includes a moleculewhich is involved in the oxidation or reduction of a biochemicalmolecule (e.g., a metabolic precursor which contains a molecular groupwhich can be oxidized or reduced) by catalyzing the transfer of ahydride ion to or from the biochemical molecule. Oxidoreductasemolecules are involved in the metabolism and catabolism of biochemicalmolecules necessary for energy production or storage, for intra- orinter-cellular signaling, and for metabolism or catabolism ofmetabolically important biomolecules. Examples of oxidoreductasesinclude glucose oxidases, methanol oxidases, choline dehydrogenases,glucose dehydrogenases, cholesterol oxidases, alcohol dehydrogenases,and cellobiose dehydrogenases.

[0362] The OP proteins of the present invention show homology to thecholine dehydrogenase family of oxidoreductases. Choline dehydrogenase(CDH) is the first enzyme of the glycine betaine synthetic pathway.Betaine, an a typical amino acid that is non-proteinogenic yet importantas an osmoprotectant, is synthesized by a two-step oxidation of choline.This reaction takes place in the mitochondrial matrix by the membranebound CDH and betaine aldehyde dehydrogenase (Landfald and Strom (1986)J. Bacteriol. 165:849 -55; Styrvold et al. (1986) J. Bacteriol.165:856-63; Grossman and Hebert (1989) Am. J. Physiol. 256(1 Pt.2):F107-12; Zhang et al. (1992) Biochim. Biophys. Acta. 1117:333-9). CDHis also coupled to the respiratory chain. Betaine is further importantin mammalian organisms as a major methyl group donor and nitrogensource.

[0363] Methyl groups derived from betaine may be used for recyclinghomocysteine to methionine. It is known that some tumor cells have anincreased need for methionine for survival. Methionine dependent tumorcells are unable to proliferate, and they arrest in the G2 phase of thecell cycle. For example, MCF-7 breast cancer cells grown inmethyl-deficient media show inhibition of cell proliferation andinduction of apoptosis. Fresh patient colon tumors have also been shownto be methionine dependent based on cell cycle analyses. Metastaticcolon tumors have a higher methionine dependence than primary tumors.Other examples of methionine dependence in tumors have been seen insmall cell lung cancer and gliomas.

[0364] Human 26343 is overexpressed in various tumors, e.g., colontumors, as compared to normal tissues (see section below on expressionlevels). Human 26343 is further elevated in later stage tumors.Elevation of the levels of the 26343 molecules of the present inventionin tumor cells may increase tumor survival by increasing the supply ofmethionine available to the tumor cells. Accordingly, inhibition of the26343 molecules of the present invention may cause tumor cell growtharrest and/or apoptosis, making the 26343 molecules of the presentinvention useful for the treatment of cellular proliferation, growth,apoptosis, differentiation, and/or migration disorders.

[0365] The 26343 molecules of the present invention may also be usefulfor the treatment of disorders characterized by the aberrant or abnormalregulation of the levels of choline, betaine (e.g., a disorderassociated with aberrant regulation of osmolarity by betaine),homocysteine (e.g., homocystinuria), and/or methionine in a subject.

[0366] The 26343 molecules of the present invention may still further beuseful for the treatment of disorders affecting tissues in which 26343protein is expressed, e.g., primary osteoblasts, pituitary, CaCO cells,keratinocytes, aortic endothelial cells, fetal kidney, fetal lung,mammary epithelium, fetal spleen, fetal liver, umbilical smooth muscle,RAII Burkitt Lymphoma cells, lung, prostate, K53 red blood cells, fetaldorsal spinal cord, insulinoma cells, normal breast and ovarianepithelia, retina, HMC-1 mast cells, ovarian ascites, d8 dendriticcells, megakaryocytes, human mobilized bone morrow, mammary carcinoma,melanoma cells, lymph, vein, U937/A70p B cells, A549con cells, WT LN Captestosterone cells, esophagus, and other tissues and/or cell typesdescribed further below.

[0367] In an alternate embodiment, any and all of the above describeddisorders may simply be referred to as “OP associated or relateddisorders”.

[0368] For example, the family of OP proteins comprise at least one, andpreferably three or more “transmembrane domains.” As used herein, theterm “transmembrane domain” includes an amino acid sequence of about 15amino acid residues in length which spans the plasma membrane. Morepreferably, a transmembrane domain includes about at least 10, 15, 20,25, 30, 35, 40, 45 or more amino acid residues and spans the plasmamembrane. Transmembrane domains are rich in hydrophobic residues, andtypically have a helical structure. In one embodiment, at least 50%,60%, 70%, 80%, 90%, 95% or more of the amino acid residues of atransmembrane domain are hydrophobic, e.g., leucines, isoleucines,tyrosines, or tryptophans. Transmembrane domains are described in, forexample, Zagotta W. N. et al. (1996) Annu. Rev. Neurosci. 19:235-63, thecontents of which are incorporated herein by reference. Amino acidresidues 41-57, 292-311, and 545-564 of the human 26343 polypeptide (SEQID NO:55) comprise transmembrane domains.

[0369] In another embodiment, an OP molecule of the present invention isidentified based on the presence of an GMC oxidoreductase signaturedomain in the protein or corresponding nucleic acid molecule. As usedherein, the term GMC oxidoreductase signature domain includes a proteindomain having an amino acid sequence of about 375-650, more preferablyabout 450-600 amino acid residues, or most preferably about 500-550amino acids and has a bit score for the alignment of the sequence to theGMC oxidoreductase signature domain (HMM) of at least about 100, 200,300, 400, 500, 600, 700, 800, or more. Preferably, a GMC oxidoreductasesignature domain includes at least about 526 amino acid residues and hasa bit score for the alignment of the sequence to the GMC oxidoreductasesignature domain (HMM) of about 767.7. The GMC oxidoreductase signaturedomain has been assigned the PFAM labels “GMC_oxred_(—)1” and“GMC_oxred_(—)2” under accession number PS00623 and PS00624,respectively (see the Pfam website, available online through WashingtonUniversity in Saint Louis). GMC oxidoreductase signature domains areinvolved in oxidoreductase activity and are described in, for example,Cavener (1992) J. Mol. Biol. 223:811-814, the contents of which areincorporated herein by reference.

[0370] To identify the presence of a GMC oxidoreductase signature domainin an OP protein and make the determination that a protein of interesthas a particular profile, the amino acid sequence of the protein issearched against a database of HMMs (e.g., the Pfam database, release2.1) using the default parameters (see the Pfam website, availableonline through Washington University in Saint Louis). A search wasperformed against the HMM database resulting in the identification of aGMC oxidoreductase signature domain in the amino acid sequence of SEQ IDNO:55 (at about residues 41-567).

[0371] A description of the Pfam database can be found in Sonhammer etal. (1997) Proteins 28:405-420, and a detailed description of HMMs canbe found, for example, in Gribskov et al. (1990) Meth. Enzymol.183:146-159; Gribskov et al. (1987) Proc. Natl. Acad. Sci. USA84:4355-4358; Krogh et al. (1994) J. Mol. Biol. 235:1501-1531; andStultz et al. (1993) Protein Sci. 2:305-314, the contents of which areincorporated herein by reference.

[0372] Isolated OP proteins of the present invention, have an amino acidsequence sufficiently identical to the amino acid sequence of SEQ IDNO:55, or are encoded by a nucleotide sequence sufficiently identical toSEQ ID NO:54 or 56. As used herein, the term “sufficiently identical”refers to a first amino acid or nucleotide sequence which contains asufficient or minimum number of identical or equivalent (e.g., an aminoacid residue which has a similar side chain) amino acid residues ornucleotides to a second amino acid or nucleotide sequence such that thefirst and second amino acid or nucleotide sequences share commonstructural domains or motifs and/or a common functional activity. Forexample, amino acid or nucleotide sequences which share commonstructural domains have at least 30%, 40%, or 50% homology, preferably60% homology, more preferably 70%-80%, and even more preferably 90-95%homology across the amino acid sequences of the domains and contain atleast one and preferably two structural domains or motifs, are definedherein as sufficiently identical. Furthermore, amino acid or nucleotidesequences which share at least 30%, 40%, or 50%, preferably 60%, morepreferably 70-80%, or 90-95% homology and share a common functionalactivity are defined herein as sufficiently identical.

[0373] As used interchangeably herein, a “OP activity”, “biologicalactivity of OP,” or “functional activity of OP,” includes an activityexerted by an OP protein, polypeptide or nucleic acid molecule on anOP-responsive cell or tissue, or on an OP protein substrate, asdetermined in vivo, or in vitro, according to standard techniques. Inone embodiment, an OP activity is a direct activity, such as anassociation with an OP-target molecule. As used herein, a “targetmolecule” or “binding partner” is a molecule with which an OP proteinbinds or interacts in nature, such that OP-mediated function isachieved. An OP target molecule can be a non-OP molecule or an OPaccessory polypeptide or molecule of the present invention (e.g., NAD⁺,FAD⁺, or other cofactor). As used herein, an “accessory” peptide ormolecule refers to a peptide or molecule whose presence is may be neededfor the proper activity of a protein (e.g., a cofactor or a metal ionthat is needed by an enzyme). In an exemplary embodiment, an OP targetmolecule is an OP ligand (e.g., choline and/or an acceptor molecule tobe reduced or oxidized choline and/or an acceptor molecule to be reducedor oxidized). Alternatively, an OP activity is an indirect activity,such as a cellular signaling activity mediated by interaction of the OPprotein with an OP ligand. The biological activities of OP are describedherein. For example, the OP proteins of the present invention can haveone or more of the following activities: 1) modulation of metabolism andcatabolism of biochemical molecules, e.g., molecules necessary forenergy production or storage; 2) modulation of betaine synthesis fromcholine; 3) modulation of methionine synthesis from homocysteine; 4)modulation of intra- or inter-cellular signaling; 5) modulation ofcellular proliferation and/or migration; and/or 6) modulation ofhormonal responses.

[0374] Accordingly, another embodiment of the invention featuresisolated OP proteins and polypeptides having an OP activity. Otherpreferred proteins are OP proteins having one or more of the followingdomains: a transmembrane domain, a GMC oxidoreductase signature domain,and, preferably, an OP activity. Additional preferred OP proteins haveat least one GMC oxidoreductase signature domain, and/or at least onetransmembrane domain and are, preferably, encoded by a nucleic acidmolecule having a nucleotide sequence which hybridizes under stringenthybridization conditions to a nucleic acid molecule comprising acomplement of the nucleotide sequence of SEQ ID NO:54 or 56.

[0375] Isolation of the Human 26343 or OP cDNA

[0376] The invention is based, at least in part, on the discovery of a65.3 kD human gene encoding a novel protein, referred to herein as 26343or OP. The entire sequence of the human clone Fbh26343 was determinedand found to contain an open reading frame termed “human OP.” The 2343nucleotide sequence encoding the human OP protein is set forth as SEQ IDNO:54. The protein encoded by this nucleic acid comprises about 594amino acids and has the amino acid sequence set forth as SEQ ID NO:55.The coding region (open reading frame) of SEQ ID NO:54 is set forth asSEQ ID NO:56. Clone Fbh26343, comprising the coding region of human OP,was deposited with the American Type Culture Collection (ATCC®), 10801University Boulevard, Manassas, Va. 20110-2209, on ______, and assignedAccession No. ______. These deposits will be maintained under the termsof the Budapest Treaty on the International Recognition of the Depositof Microorganisms for the Purposes of Patent Procedure. These depositswas made merely as a convenience for those of skill in the art and arenot an admission that a deposit is required under 35 U.S.C. §112.

[0377] Analysis of the Human 26343 or OP Molecule

[0378] A search for domain consensus sequences was performed using theamino acid sequence of human 26343 or OP and a database of HMMs (thePfam database, release 2.1) using the default parameters (describedabove). The search revealed a GMC oxidoreductase signature domain (Pfamlabel GMC_oxred; Pfam Accession Numbers PS00623 and PS00624) within SEQID NO:55 at residues 41-567.

[0379] A search was performed against the ProDom database resulting inthe identification of a portion of the deduced amino acid sequence ofhuman 26343 or OP (SEQ ID NO:55) which has a 39% identity to ProDomentry “FAD flavoprotein oxidoreductase precursor dehydrogenase lyasesignal protein cellobiose isoform”) over residues 41 to 351 and 37%identical over residues 488-568. In addition, human 26343 or OP is 50%identical to ProDom entry “L-sorbosone dehydrogenase, FAD dependent”over residues 501-573 of SEQ ID NO:55. In addition, human 26343 or OP is57% identical to ProDom entry “NADH:N-amido-scyllo-inosamineoxidoreductase” over residues 40-74 and 32% identical over residues 254to 308 of SEQ ID NO:55.

[0380] A search was also performed against the Prosite database, andresulted in the identification of one possible glycosaminoglycanattachment site within the human OP protein at residues 308-311 of SEQID NO:55. In addition, protein kinase C phosphorylation sites wereidentified within the human 26343 or OP protein at residues 8183, 85-87,283-285, 494-496, 515-517, and 592-594 of SEQ ID NO:55. This search alsoidentified casein kinase II phosphorylation sites at residues 37-40,231-234, 415-418, 455-458, 494-497 of SEQ ID NO:55. A tyrosinephosphorylation site motif was also identified in the human 26343 or OPprotein at residues 503-510 of SEQ ID NO:55. The search also identifiedthe presence of N-myristoylation site motifs at residues 20-25, 47-52,129-134, 296-301, 309-314, 329-334, 374-379, and 429-434 of SEQ IDNO:55. In addition, the search identified an amidation site at residues234-237, and a GMC oxidoreductase signature sequence at amino acids297-311 of SEQ ID NO:55.

[0381] An analysis of the possible cellular localization of the human26343 or OP protein based on its amino acid sequence was performed usingthe methods and algorithms described in Nakai and Kanehisa (1992)Genomics 14:897-911, and available online through the PSORT serverwebsite. The results from this analysis predict that the human 26343 orOP protein is found in the mitochondria, in the cytoplasm, in thenucleus, and in peroxisome.

[0382] An analysis of putative post-translationally truncated variantsindicated that the mature protein may have residue 16 of SEQ ID NO:55(arginine) as the N-terminal residue.

[0383] Analysis of Human 26343 or OP Expression

[0384] The following describes the expression of human 26343 or OP mRNAin various tissues, tumors, cell lines, and disease models, asdetermined using the TaqMan™ procedure and in situ hybridizationanalysis.

[0385] For in situ analysis, various tissues, e.g., tissues obtainedfrom liver or colon, were first frozen on dry ice.

[0386] As indicated by the data obtained from the TaqMan analysis, human26343 was expressed highly in the following tissues: normal fetal heart,normal brain cortex, brain (hypothalamus), brain (glioblastoma), normalbreast, breast tumor (IDC), prostate tumor, colon tumor, normal kidney,normal liver, fibrotic liver, normal fetal liver, and skeletal muscle.Human 26343 is also expressed in the following tissues: normal heart,heart (congestive heart failure), normal spinal cord, normal prostate,normal ovary, and lung (chronic obstructive pulmonary disease).

[0387] Human 26343 showed increased expression in 100% of the clinicalcolon tumor samples tested, compared with clinical normal colon tissuesamples.

[0388] Human 26343 showed increased expression in 100% of the clinicalliver metastasis samples tested, compared with clinical normal livertissue samples.

[0389] Human 26343 showed increased expression in 57% of the clinicallung tumors tested, compared with clinical normal lung tissue samples.

[0390] Human 26343 showed expression in most Xenograft friendly celllines, e.g., MCF-7, ZR75, T47D, DLD-1, SW 480, SW 620, HCT 116, Colo205, NCIH 125, NCIH 322, NCIH 460, and A549. Colon tumor cell lines showincreased 26343 expression in later stages as follows: Cell line StageRelative Expression SW 480 B 8.0 HCT 116 B/C 20.6 DLD-1 C 19.9 Colo 205Ascites 62.5 SW 620 Lymph Metastasis 104.7

[0391] The results from the in situ hybridization analysis indicate thathuman 26343 is expressed in 100% of primary colon tumors tested and 100%of metastatic tumors tested, as compared to 0% in normal tissues tested.

[0392] The data also indicate that human 26343 is focally expressed in20% of lung tumors tested, as compared to 0% of the corresponding normaltissues.

[0393] Cell Cycle Analysis

[0394] The following describes the results from studies designed todetermine how the expression of human 26343 mRNA is regulated during thecell cycle.

[0395] Transcriptional profiling analysis showed that human OPexpression was increased in aphidocholine synchronized MCF10a cellswithin the G0/G1 phase of the cell cycle.

[0396] Human 26343 also showed cell cycle regulated expression inaphidocholine synchronized HCT 116 colon carcinoma cells, with higherexpression in the G2/M phase of the cell cycle.

[0397] Human 26343 also showed cell cycle regulated expression inaphidocholine synchronized A549 lung carcinoma cells.

[0398] Reintroduction of Smad4, a tumor supressor gene in the TGFβsignaling pathway, into SW 480 cells (colon carcinoma cells that aredeficient in the expression of Smad4) by transient transfection caused adecrease in the expression of human 26343 in these cells.

[0399] Human 26343 expression was upregulated in the RER− (replicationerror) cell lines Caco2 and SW 480, as compared to RER+ cell lines. RER−cell lines have increased difficulty in mismatch repair during DNAreplication.

[0400] Increased expression of human 26343 in RER− cells and in Smad4deficient cells indicates that increased human 26343 expression isassociated with situations known to cause progression to later stagetumors, i.e., errors in TGFβ signaling and mismatch repair.

[0401] Measurment of Methionine Levels in Tumor Cells

[0402] The following describes the measurement of methionine levels intumor cells, as may be determined using the methods of Tan, Y. et al.(1999) Clin. Cancer Res. 5:2157-2163, the contents of which areincorporated herein by reference.

[0403] Briefly, tumor methionine levels are determined using an HPLCmachine (Hitachi L-5200A Intelligent pump; Hitachi, Ltd., Tokyo, Japan)after derivitization of serum amino acids with the fluoraldehyde reagentOPA as described in Tan, Y. et al. (1997) Anticancer Res. 17:3857-3860and Lishko, V. K. et al. (1993) Anticancer Res. 13:1465-1468.Supernatants are prepared from tumor tissue after sonication for 30seconds and subsequent centrifugation at 13,000 rpm for 10 minutes.Tumor supernatant samples (25 μl) are precipitated by acetonitrile (75μl). Ten μl of supernatant are mixed with 5 μl of OPA. After 1 minute,50 μl of 0.1 M sodium acetate (pH 7.0) are added, and a 20 μl sample isloaded on a reversed-phase Supelcosil LC-18-DB column (particle size: 5μm, 25 cm×4.8 mm) at room temperature. The column is eluted withsolution A (tetrahydrofuran:methanol:0.1 M sodium acetate (pH 7.2);5:95:900) and solution B (methanol). A gradient from 20-60% of solutionB, run a flow rate of 1.5 ml/min, resolves the amino acids. The eluateis read with a fluorescence spectrophotometer (Hitachi, F1000) at awavelength of 350-450 nm. The limit of detection is ˜0.1 μM methionine.

[0404] Measurment of OP Choline Dehydrogenase Activity

[0405] The following describes the measurement of OP cholinedehydrogenase activity in cells, as may be determined using the methodsof Zhang, J. et al. (1992) Biochim. Biophys. Acta 1117:333-339, thecontents of which are incorporated herein by reference.

[0406] The following methods are used to assay the choline dehydrogenaseactivity of the OP molecules of the invention. The methods are performedwith purified OP molecules, or with mitochondrial preparationscontaining OP molecules, as described below.

[0407] Preparation of Mitochondria

[0408] A 12 gram wet weight tissue or cell sample (e.g., a normal tissueor cell sample, or a tumor sample) is homogenized in 108 ml 0.25 Msucrose at a temperature of not more than 4° C. and centrifuged at 700×gat 4° C. for 8 minutes. The supernatant is subsequently centrifuged at17,000×g at 4° C. for 10 minutes. The resulting mitochondrial pellet isresuspended in 30 ml of 0.25 M sucrose and repeatedly treated as aboveat least three times. The purity of the mitochondria is confirmed bydetermining the activities of a mitochondrial marker, fumarase (Stenech,J. (1984) in Experimental Biochemistry (Stenech, J., ed.), pp. 400-401,Allyn and Bacon, Boston); a cytosolic marker, lactate dehydrogenase(Worthington Biochemicals, Freehold, N.J.); and a microsomal marker,glucose-6-phosphatase (Leloir, L. F. and Cardini, C. E. (1975) MethodsEnzymol. 3:840-844). This preparation is kept frozen at −90° C. untilused. The protein concentration of the mitochondria is determined by themethod of Bradford ((1976) Anal. Biochem. 72.248-254). Colorimetric OPcholine dehydrogenase assay

[0409] OP choline dehydrogenase activity may be measured by the PMS-DCIPcolorimetric method, as described in Singer, T. P. (1974) in Methods ofBiochemical Analysis (Glick, D., ed.), Vol. 22, pp. 133-169, John Wiley,New York; and Rendina, G. and Singer, T. P. (1959) J. Biol. Chem.234:1605-1610.

[0410] Radioenzymatic Assay of OP Choline Dehydrogenase Activity

[0411] A mitochondrial preparation containing OP molecules, made usingthe methods described above, is incubated with [methyl-¹⁴C]choline (55mCi/mmol; ICN Biomedicals, Irvine, Calif.) in reaction medium containing40 mM Tris buffer (pH 7.6) or 40 mM glycine buffer (pH 8.5) for varyingamounts of time at 37° C. The reaction is inactivated by adding 1/10 ofthe reaction volume in the form of 1.2 M HCl. Mixtures are extractedwith one reaction volume of methanol and 2 volumes of chloroform. Afterbriefly vortexing at room temperature, the phases are separated by lowspeed centrifugation and collected.

[0412] HPLC Purification of Choline, Betaine Aldehyde, and Betaine

[0413] 50 μl of the methanol-water phase (see above) is mixed with 100μl of methanol and then analyzed by HPLC (3×8C Pecosphere Cartridge,silica column (Perkin Elmer, Norwalk, Conn.)). The reaction products areeluted (at a flow rate of 1.5 ml/minute) with buffer A containing 800 mlacetonitrile, 68 ml ethanol, 5 ml of 3:2 (v/v) 1.0 M ammoniumacetate-glacial acetic acid buffer, 127 ml water, and 10 ml 1.0 Mpotassium dihydrogen phosphate. The radioactivity of the eluent isdetermined using an on-line solid scintillant radiometric detector(Model BL 507A, Berthold, Nashua, N.H.). The efficiency of detection isdetermined using radiolabeled standards. After each sample is run, thecolumn is washed for 5 minutes with buffer B containing the samecomponents as buffer A in the following volumes (ml): 400:68:132:400:10.In a typical chromatogram, only the three peaks of interest aredetected, i.e., the [methyl-¹⁴C]choline substrate, and the two oxidationproducts, betaine aldehyde and betaine (betaine aldehyde is anintermediate in the two-step oxidation process that produces betainefrom choline). The combined radioactivity in these three peaks is takenas 100%. In order to determine the amount of each product formed, thefollowing formula is used:${{moles}\quad {of}\quad {product}} = \frac{{DPM}\quad {in}\quad {peak}}{{total}\quad {DPM}}$moles  choline  substrate  in  incubation  medium  

[0414] Partial Purification of OP Choline Dehydrogenase

[0415] Mitochondria are centrifuged at 17,000×g at 4° C. to remove theoriginal buffer. The mitochondrial pellet is resuspended in 1.2 Msucrose, 0.05 mM EDTA, and 40 mM ammonium acetate for 45 minutes at 25°C. The resulting preparation is centrifuged at 24,000×g at 4° C. for 10minutes (Beckman, Ti 50.2 rotor). The resulting pellet is referred tointerchangeably herein as an “aged mitochondrial pellet” or “agedmitochondria” (Lin, C. S. and Wu, R. D. (1986) J. Prot. Chem.5:193-200).

[0416] The aged mitochondrial pellet is resuspended by gentle stirringin 60 mM glycine-NaOH buffer (pH 10) at 4° C. for 40 minutes. Theresulting preparation is centrifuged at 24,000×g at 4° C. for 10minutes. The supernatant is discarded after centrifugation.

[0417] The pellet is extracted with 0.2 mg digitonin per mg protein.Digitonin is dissolved in 0.25 M warm sucrose and sonicated with a probesonicator for 1-2 minutes and then chilled and gently added drop-wise tothe mitochondrial preparation over a 5 minutes period. After incubationfor 25 minutes at 4° C., the preparation is centrifuged at 24,000×g at4° C. for 10 minutes. The pellet is resuspended in 0.25 M NaCl (Lin andWu (1986) supra).

[0418] The digitonin-extracted mitochondrial preparation in 0.25 M NaClis sonicated at 4° C. for 5 minutes and subsequently centrifuged at100,000×g at 4° C. for 30 minutes (Beckman, Ti 50.2 rotor). The pelletis resuspended in a buffer containing 0.12 M sucrose, 0.05 mM EDTA, 6.0mM choline (Fisher, Springfield, N.J.; recrystallized in methanol), 0.03M potassium phosphate, and 1.0 M NaCl. Lubrol WX (0.2 mg per mg protein;Serva, Feinbiochemica, Heidelberg, Germany) is added, the preparation isshaken for 10 minutes at 4° C. and subsequently centrifuged at 100,000×gfor 30 minutes. The solubilized OP molecules are present in thesupernatant (Lin and Wu (1986) supra).

[0419] Thin Layer Chromatography Separation of Choline Oxidation Product

[0420] Choline, betaine aldehyde, and betaine are purified by thin layerchromatography on silica gel plates (LK5D; Whatman Company) developedwith a mixture containing chloroform, methanol, and 0.1 M HCl (65:30:4;v/v) and visualized by staining in iodine vapor.

[0421] Measurement of OP Choline Dehydrogenase Activity

[0422] In one experiment, 62.5 μg OP protein (mitochondrial preparation)is incubated with 0.572 μCi, 0.15 mM [methyl-¹⁴C]choline in Tris buffer(pH 7.6) at 37° C. The total reaction volume is 150 μl. In anotherexperiment, 31.5 μg OP protein (mitochondrial preparation) is incubatedwith 0.27 μCi, 0.13 mM [methyl-¹⁴C]choline in Tris buffer (pH 8.5) at37° C. The total reaction volume is 150 μl. In still another experiment,varying amounts of OP protein (mitochondrial preparation) are incubatedwith 0.92 μCi, 0.2 mM [methyl-¹⁴C]choline in Tris buffer (pH 7.6) for 10minutes at 37° C. The total reaction volume is 150 μl.

[0423] In another experiment, the effect of electron acceptors andcyanide on OP activity is measured. 0.5 mg of OP protein (mitochondrialpreparation) is incubated with 0.1 mM, 0.41 μCi [methyl-¹⁴C]choline inKregs-Hanseleit buffer (pH 7.75), in the presence of 1 mM potassiumcyanide (KCN), phenazine methosulfate (PMS) and dichloroindophenol(DCIP), or PMS, DCIP, and KCN. The total reaction volume is 0.5 ml. Thereaction mixture is incubated at 37° C. for 20 minutes. In anotherexperiment, 11 μg OP protein (solubilized preparation) is incubated with20 nmol, 0.27 μCi [methyl-¹⁴C]choline at 37° C. in 0.01 M KH₂PO₄ (pH7.7) for 40 minutes in the presence or absence of 0.1 mM NAD+, or 1.0 mMPMS, or 0.1 mM PMS and 0.1 mM NAD+together. The total reaction volume is195

[0424] In another experiment, the effect of changes in pH on OP activityis measured. 63 μg OP protein is incubated in 40 mM phosphate, 40 mMglycine, 40 mM Hepes, 40 mM boric acid, or 40 mM Tris buffer atdifferent pH. The [methyl-¹⁴C]choline concentration is 0.8 μCi, 0.19 mMfor Tris-HCl and phosphate buffer, and 0.25 μCi, 0.13 mM for otherbuffers. The reactions are carried out at 37° C. for 10 minutes.

[0425] Human 56638

[0426] The present invention is based, at least in part, on theidentification of a novel neprilysin protease referred to herein as“56638”. The human 56638 sequence (SEQ ID NO:57), which is approximately2953 nucleotides long including untranslated regions, contains apredicted methionine-initiated coding sequence of about 2340nucleotides, including the stop codon (SEQ ID NO:59). Although the ATGat position 1-3 of SEQ ID NO:59 is the preferred start site oftranslation, other embodiments are included wherein, e.g., the ATG atposition 28-30 of SEQ ID NO:59 is the start site of translation. Thecoding sequence encodes an 779 amino acid protein (SEQ ID NO:58). Thehuman 56638 protein of SEQ ID NO:58 is predicted to have a signalpeptide at about amino acid 1-44 of SEQ ID NO:58.

[0427] Human 56638 sequence contains the following regions or otherstructural features: an M13 peptidase (neprilysin) domain (PF01431) fromabout amino acid 572 to 778 of SEQ ID NO:58, which includes thecharacteristic HEXXH zinc-binding active site of metallopeptidases(PS00142; SEQ ID NO:62) located at about amino acid 610 to 619 of SEQ IDNO:58.

[0428] The human 56638 sequence can additionally include: eightN-glycosylation sites (PS00001) located from about amino acid 156 to159, from about amino acid 177 to 180, from about amino acid 207 to 210,from about amino acid 243 to 246, from about amino acid 350 to 353, fromabout amino acid 530 to 533, from about amino acid 638 to 641, and fromabout amino acid 657 to 660 of SEQ ID NO:58; one cAMP and cGMP-dependentprotein kinase phosphorylation site (PS00004) from about amino acid 183to 186 of SEQ ID NO:58; eleven protein kinase C phosphorylation sites(PS00005) from about amino acid 158 to 160, from about amino acid 244 to246, from about amino acid 269 to 271, from about amino acid 361 to 363,from about amino acid 391 to 393, from about amino acid 412 to 414, fromabout amino acid 493 to 495, from about amino acid 503 to 505, fromabout amino acid 551 to 553, from about amino acid 726 to 728, and fromabout amino acid 735 to 737 of SEQ ID NO:58; eight casein kinase IIphosphorylation sites (PS00006) from about amino acid 137 to 140, fromabout amino acid 158 to 161, from about amino acid 179 to 182, fromabout amino acid 429 to 432, from about amino acid 445 to 448, fromabout amino acid 482 to 485, from about amino acid 503 to 506, and fromabout amino acid 673 to 676 of SEQ ID NO:58; three tyrosine kinasephosphorylation sites (PS00007) from about amino acid 435 to 442, fromabout amino acid 520 to 526, and from about amino acid 645 to 653 of SEQID NO:58; nine N-myristoylation sites (PS00008) from about amino acid 9to 14, from about amino acid 44 to 49, from about amino acid 78 to 83,from about amino acid 93 to 98, from about amino acid 547 to 552, fromabout amino acid 608 to 613, from about amino acid 683 to 688, fromabout amino acid 706 to 711, and from about amino acid 750 to 755 of SEQID NO:58; a prenyl group binding site (CAAX box) (PS00294) from aboutamino acid 776 to 779 of SEQ ID NO:58; and a signal peptide from aboutamino acid 1 to 44 of SEQ ID NO:58, resulting in a mature protein of 822amino acids, from amino acid 45 to 779 of SEQ ID NO:58.

[0429] Polypeptides of the invention include fragments which include:all or part of a hydrophobic sequence, e.g., the sequence of 560-570 ofSEQ ID NO:58; all or part of a hydrophilic sequence, e.g., the sequenceof 620-640 of SEQ ID NO:58; a sequence which includes a Cys or aglycosylation site.

[0430] For general information regarding PFAM identifiers, PS prefix andPF prefix domain identification numbers, refer to Sonnhammer et al.(1997) Protein 28:405-420.

[0431] A plasmid containing the nucleotide sequence encoding human 56638(clone “Fbh566338FL”) was deposited with American Type CultureCollection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209,on ______ and assigned Accession Number ______. This deposit will bemaintained under the terms of the Budapest Treaty on the InternationalRecognition of the Deposit of Microorganisms for the Purposes of PatentProcedure. This deposit was made merely as a convenience for those ofskill in the art and is not an admission that a deposit is requiredunder 35 U.S.C. §112.

[0432] The 56638 protein contains a significant number of structuralcharacteristics in common with members of the neprilysin family ofmetalloproteases.

[0433] The neprilysin family comprises a number of related enzymes thatshare high structural homology and a common catalytic mechanism thatinvolves cleavage of a protein substrate by hydrolysis of an amide bondthat depends upon the presence of a metal ion, e.g., zinc. Neprilysinsare mammalian membrane metalloproteases which contain the active siteconsensus sequence VxxHExxH (SEQ ID NO:61; amino acids 610 to 619 of SEQID NO:58) found in other zinc metalloproteases. The histidines are twoof the three Zn-coordinating ligands, and the glutamate plays a role incatalysis by polarizing a water molecule. The hydrolysis occurs throughthe formation of a pentacoordinated complex of the metal which includesthe three Zn-coordinating amino acids of the peptidase, the oxygen ofthe scissile bond, and the water molecule that is initially bound to theZn atom. For a review, see, Barrett (1995) Methods in Enzymol248:263-283. In addition, neprilysin family members share otherstructural features. They can be highly glycosylated type II integralmembrane proteins, and they can have a cluster of conserved cysteineresidues following the transmembrane domain which are involved instabilizing the active enzyme through the formation of sulfide bridges(Tanja et al (2000) Biochem Biophys Res Comm 271:565-570).

[0434] The human 56638 proteins of the present invention showsignificant homology to members of the neprilysin metallopeptidasefamily, and in particular, to the mouse NL1/SEP and the rat neprilysinII proteins (Ghaddar et al. (2000) Biochem J 347:419-429; Ikeda et al.(1999) J Biol Chem 274:32469-32477). Like mouse NL1/SEP and ratneprilysin II, 56638 is a secreted protein. 56638 has the characteristicVxxHExxH (SEQ ID NO:61) zinc-binding metallopeptidase consensus sequence(PS00142), located at about amino acid 610 to 619 of SEQ ID NO:58.Neprilysin family members include neprilysin, endothelin convertingenzyme (ECE), Kell Blood group antigen, PEX, and X-converting enzyme(XCE), and soluble secreted endopeptidase (SSE). Examples of substratesof the neprilysin peptidase family include, but are not limited to,neuropeptides involved in pain control, e.g., enkephalin, somatostatin,and substance P; and vasoactive peptides that mediate inflammation andpain, e.g., neurotensin, atrial natriuretic peptide (ANP), neurokinin,tachykinin, bradykinin, and endothelin (Checler et al. (1983) JNeurochem 41:375; Matsas et al. (1983) Proc Natl Acad Sci USA 80:3111;Matsas et al. (1984) Biochem J 223:433; Stepehenson and Kenny (1987)Biochem J 241:237; Turner and Tanzawa (1997) FASEB J 11:355-364). MouseNL1/Sep has been shown to cleave enkephalin in vivo. Enkephalin, a majorsubstrate of neprilysin, is one of several naturally occurringmorphinelike substances released from nerve endings of the centralnervous system and the adrenal medulla. It acts as an analgesic andsedative in the body and appears to affect mood and motivation. Asneprilysin is responsible for the inactivation of enkephalin and otherbioactive peptides involved in inflammation and pain, neprilysins arecritical for the proper function of many physiological systems,including neurotransmission, pain control, inflammatory response, andvascular tone.

[0435] Other neprilysin family members include a marker of common acutelymphoblastic leukemia antigen present at the surface of B cells (Roqueset al. (1993) Pharmacol Rev 45:87), and the Kell blood group antigen(Lee et al. (1999) Proc Natl Acad Sci USA 88:6353-6357). Kell antigensare highly immunogenic and may cause severe fetal anemia in sensitizedmothers, erythroblastosis in newborn infants, and severe hemolyticreactions if mismatched blood is transfused.

[0436] A 56638 polypeptide can include a “neprilysin domain” or regionshomologous with a “neprilysin domain.” A 56638 polypeptide canoptionally further include a signal peptide; at least one, two, three,four, five, six, seven, preferably eight N-glycosylation sites; at leastone cAMP and cGMP-dependent protein kinase phosphorylation site; atleast one, two, three, four, five, six, seven, eight, nine, ten,preferably eleven, protein kinase C phosphorylation sites; at least one,two, three, four, five, six, seven, preferably eight, casein kinase IIphosphorylation sites, at least one, two, preferably three, tyrosinekinase phosphorylation sites; at least one, two, three, four, five, six,seven, eight, preferably nine, N-myristoylation sites; at least oneprenyl group binding site.

[0437] As used herein, the term “neprilysin domain” includes an aminoacid sequence of about 50 to 350 amino acid residues in length, morepreferably about 100 to 300 amino acid residues, or about 200 to 215amino acids, and having a bit score for the alignment of the sequence tothe neprilysin domain (HMM) of at least 100, preferably 150, morepreferably 200, most preferably 250 or more. Preferably, the domainincludes a zinc-binding active site of metallopeptidase domains(PS00142) located at about amino acid 610 to 619 of SEQ ID NO:58. Theneprilysin domain (HMM) has been assigned the PFAM Accession NumberPF01431. An alignment of the neprilysin domain (amino acids 572 to 778of SEQ ID NO:58) of human 56638 with a consensus amino acid sequencederived from a hidden Markov model derived from PFAM (SEQ ID NO:60)yields a bit score for the alignment of 270.4 (E=2.4e−77).

[0438] In a preferred embodiment 56638 polypeptide or protein has a“neprilysin domain” or a region which includes at least about 50 to 350,more preferably about 100 to 300, or 200 to 215 amino acid residues andhas at least about 60%, 70% 80% 90% 95%, 99%, or 100% homology with a“neprilysin,” e.g., the neprilysin domain of human 56638 (e.g., residues572 to 778 of SEQ ID NO:58).

[0439] To identify the presence of a “neprilysin” domain in a 56638protein sequence, and make the determination that a polypeptide orprotein of interest has a particular profile, the amino acid sequence ofthe protein can be searched against a database of HMMs (e.g., the Pfamdatabase, release 2.1) using the default parameters. For example, thehmmsf program, which is available as part of the HMMER package of searchprograms, is a family specific default program for MILPAT0063 and ascore of 15 is the default threshold score for determining a hit.Alternatively, the threshold score for determining a hit can be lowered(e.g., to 8 bits). A description of the Pfam database can be found inSonhammer et al. (1997) Proteins 28(3):405-420 and a detaileddescription of HMMs can be found, for example, in Gribskov et al. (1990)Meth Enzymol 183:146-159; Gribskov et al. (1987) Proc Natl Acad Sci USA84:4355-4358; Krogh et al. (1994) J Mol Biol. 235:1501-1531; and Stultzet al. (1993) Protein Sci 2:305-314, the contents of which areincorporated herein by reference. A search was performed against the HMMdatabase resulting in the identification of a “neprilysin” domain in theamino acid sequence of human 56638 at about residues 572 to 778 of SEQID NO:58. The identified neprilysin domain is depicted in SEQ ID NO:60.

[0440] A 56638 protein can further include a signal peptide, and ispredicted to be a secreted protein. As used herein, a “signal peptide”or “signal sequence” refers to a peptide of about 20 to 60, preferablyabout 30 to 50, more preferably, about 44 amino acid residues in lengthwhich occurs at the N-terminus of secretory and integral membraneproteins and which contains a majority of hydrophobic amino acidresidues. For example, a signal sequence contains at least about 20 to60, preferably about 30 to 50, more preferably, 44 amino acid residues,and has at least about 40-70%, preferably about 50-65%, and morepreferably about 55-60% hydrophobic amino acid residues (e.g., alanine,valine, leucine, isoleucine, phenylalanine, tyrosine, tryptophan, orproline). Such a “signal sequence”, also referred to in the art as a“signal peptide,” serves to direct a protein containing such a sequenceto a lipid bilayer. For example, in one embodiment, a 56638 proteincontains a signal sequence of about amino acids 1 to 44 of SEQ ID NO:58.The “signal sequence” is cleaved during processing of the matureprotein. The mature 56638 protein corresponds to amino acids 45 to 778of SEQ ID NO:58.

[0441] As used herein, a “56638 activity,” “biological activity of56638,” or “functional activity of 56638,” refers to an activity exertedby a 56638 protein, polypeptide or nucleic acid molecule on e.g., a56638-responsive cell or on a 56638 substrate, e.g., a proteinsubstrate, as determined in vivo or in vitro. In one embodiment, a 56638activity is a direct activity, such as an association with a 56638target molecule. A “target molecule” “substrate” or “binding partner” isa molecule with which a 56638 protein binds or interacts in nature. A56638 activity can also be an indirect activity, e.g., a cellularsignaling activity mediated by interaction of the 56638 protein with a56638 binding partner. In an exemplary embodiment, 56638 is an enzymefor an enkephalin substrate.

[0442] Based on the above-described sequence similarities and the tissuedistribution described below, the 56638 molecules of the presentinvention are predicted to have similar biological activities asneprilysin metalloprotease family members. Thus, in accordance with theinvention, a 56638 metalloprotease or subsequence or variant polypeptidemay have one or more domains and, therefore, one or more activities orfunctions characteristic of a neprilysin metalloprotease family member,including, but not limited to, (1) the ability to modulate the activityof a bioactive peptide, (2) the ability to cleave a neprilysinsubstrate, e.g., enkephalin, (3) the ability to modulate pain orinflammation response, (4) the ability to modulate spermatid cellactivity or infertility, or (5) the ability to modulate hematopoieticcell activity, e.g., erythroid cell activity or B cell activity. Thus,the 56638 molecules can act as novel diagnostic targets and therapeuticagents for controlling neprilysin associated disorders.

[0443] Neprilysin is involved in the inactivation of the opioidenkephalins in the brain, which induce analgesic responses. Inhibitorsof neprilysin are thus able to potentiate the analgesic effects ofexogenous enkephalins, as evaluated by analgesic tests on animals, e.g.,the hot plate test, tail flick test, writhing test, paw pressure test,all electric stimulation test, tail withdrawal test, or formalin test(Roques et al. (1995) Methods in Enzymology 248:263-283). Thus, 56638neprilysin or subsequence or variant having neprilysin activity iscapable of cleaving one or more protein substrates, e.g., biologicallyactive neuropeptides, e.g., enkephalin, substance P, or somatostatin, tomodulate pain response.

[0444] Neprilysin family members are also involved in the inflammatoryresponse. Besides enkephalin, other neprilysin substrates includeendothelin (a polypeptide produced by endothelial cells that stimulatescontraction of the underlying smooth muscle of blood vessel walls), andvasoactive peptides that cause vasodilation and pain, e.g., neurotensin,atrial natriuretic peptide (ANP), neurokinin, tachykinin, bradykinin,and endothelin.

[0445] TaqMan analysis revealed that 56638 mRNA is expressed in humanadrenal gland, brain, heart, kidney, liver, lung, mammary gland,placenta, prostate, salivary gland, muscle, small intestine, spleen,stomach, testes, thymus, trachea, uterus, spinal cord, skin, and dorsalroot ganglion (DRG). The highest 56638 mRNA expression was observed intestes, trachea, brain, spinal cord and DRG.

[0446] As 56638 mRNA is highly expressed in human testis, it suggests arole for 56638 in, e.g., fertility or spermatid development. Human 56638appears to be a human orthologue of mouse neprilyisn NL1/SEP and the ratneprilysin II proteins (Ghaddar et al. (2000) Biochem J 347:419-429;Tanja et al. (2000) Biochem Biophys Res Comm 271:565-570). Like 56638,mouse NL1/SEP and rat neprilysin II are highly expressed in testis, andare secreted proteins. The rat and mouse proteins have been localized tothe seminiferous tubules and, specifically, to spermatids (Ibid).Testicular neprilysin enzymes may act to modulate enkephalins acting asintratesticular paracrine/autocrine factors. Thus, the 56638 moleculescan act as novel diagnostic targets and therapeutic agents controllingsperm formation or other processes related to fertility, e.g.,spermatogenesis or fertilization.

[0447] As 56638 mRNA is highly expressed in human trachea, it alsosuggests a role for 56638 in modulation of the activity of bioactivepeptides in the trachea, bronchus, and lung. Thus, the 56638 moleculescan act as novel diagnostic targets and therapeutic agents controllingrespiratory disorders, e.g., chronic obstructive pulmonary disease,emphysema, amyloidosis, lung disease, lung cancer, sleep apnea,bronchitis, pneumonias, silicosis, pulmonary edema, interstitialrestrictive lung diseases, pulmonary embolus, or pulmonary hypertension.

[0448] 56638 mRNA is also highly and widely expressed in the central andperipheral nervous system. More specifically, high levels of 56638 mRNAexpression were found in human brain, spinal cord and DRG. Taqmanexperiments in rat showed that 56638 is expressed in pituitary gland,spinal cord, brain, nerve, TRG, and DRG. In situ hybridization with a56638 probe shows that 56638 is heterogeneously expressed in monkey CNS,including expression in cerebral cortex, spinal cord, brain stem nucleusand hypothalamus. Hence, 56638 is likely a neuropeptidase, e.g., aneuropeptidase involved in pain response.

[0449] Animal models of pain response include, but are not limited to,axotomy, the cutting or severing of an axon; chronic constriction injury(CCI), a model of neuropathic pain which involves ligation of thesciatic nerve in rodents, e.g., rats; or intraplantar Freund's adjuvantinjection as a model of arthritic pain. Other animal models of painresponse are described in, e.g., ILAR Journal (1999) Volume 40, Number 3(entire issue). Taqman experiments on rodent models of pain responseshowed that the 56638 gene is up-regulated in DRG seven days afteraxotomy and seven days after CCI. In situ hybridization experiments inrat pain models show up-regulation of the 56638 gene one and seven daysafter axotomy and after complete Freund's adjuvant intraplantarinjection. These levels go back to normal at later time points. Nocontralateral effects were observed. These experiments indicate a rolefor the 56638 molecule in pain response.

[0450] Therefore, neprilysin and 56638 associated disorders candetrimentally affect regulation and modulation of the pain response; andvasoconstriction, inflammatory response and pain therefrom. Examples ofneprilysin associated disorders in which the 56638 molecules of theinvention may be directly or indirectly involved include pain, painsyndromes, and inflammatory disorders, including inflammatory pain.

[0451] As the 56638 polypeptides of the invention may modulate56638-mediated activities, they may be useful for developing noveldiagnostic and therapeutic agents for 56638-mediated or relateddisorders. For example, the 56638 molecules can act as novel diagnostictargets and therapeutic agents controlling pain, pain disorders, andinflammatory disorders. For example, a 56638 inhibitor can be useful inthe treatment of pain, as 56638 inhibition could increase the endogenouslevels of enkephalins and thereby increase the associated analgesicresponse.

[0452] The 56638 molecules can also act as novel diagnostic targets andtherapeutic agents controlling pain caused by other disorders, e.g.,cancer, e.g., prostate cancer. For example, endothelin, which isinactivated by neprilysin, is associated with the excruciating,debilitating pain that comes when prostate cancer invades the bone(reviewed in Nelson and Carducci (2000) BJU Int 85 Suppl 2:45-8). Inaddition, a neprolysin family member can be a marker of common acutelymphoblastic leukemia antigen present at the surface of B cells (Roqueset al. (1993) Pharmacol Rev 45:87). Accordingly, the 56638 molecules canact as novel diagnostic targets and therapeutic agents for controllingone or more of cellular proliferative and/or differentiative disorders,or pain therefrom.

[0453] The 56638 molecules can also act as novel diagnostic targets andtherapeutic agents for brain disorders.

[0454] In addition, a neprolysin family member can be a Kell blood groupantigen (Lee et al. (1999) Proc Natl Acad Sci USA 88:6353-6357). Kellantigens are highly immunogenic and may cause severe fetal anemia insensitized mothers, erythroblastosis in newborn infants, and severehemolytic reactions if mismatched blood is transfused. Therefore, the56638 molecules can also act as novel diagnostic targets and therapeuticagents controlling disorders related to hematopoietic cells, e.g., bloodcell- (e.g., erythroid-) associated disorders, e.g., anemia, orerythroblastosis.

[0455] The 56638 nucleic acid and protein of the invention can be usedto treat and/or diagnose a variety of immune disorders.

[0456] Identification and Characterization of Human 56638 cDNA

[0457] The human 56638 sequence (SEQ ID NO:57), which is approximately2953 nucleotides long, including untranslated regions, contains apredicted methionine-initiated coding sequence of about 2340nucleotides, including the termination codon (nucleotides indicated as“coding” of SEQ ID NO:57; SEQ ID NO:59). The coding sequence encodes a779 amino acid protein (SEQ ID NO:58).

[0458] Tissue Distribution of 56638 mRNA

[0459] Endogenous human 56638 gene expression was determined using thePerkin-Elmer/ABI 7700 Sequence Detection System which employs TaqMantechnology.

[0460] To determine the level of 56638 in various human tissues aprimer/probe set was designed using Primer Express (Perkin-Elmer)software and primary cDNA sequence information. Total RNA was preparedfrom a series of human tissues using an RNeasy kit from Qiagen. Firststrand cDNA was prepared from 1 μg total RNA using an oligo-dT primerand Superscript II reverse transcriptase (Gibco/BRL). cDNA obtained fromapproximately 50 ng total RNA was used per TaqMan reaction. 56638 mRNAlevels were analyzed in a variety of samples of human tissues, and inrodent models of pain response.

[0461] Relative 56638 mRNA expression was determined using mRNA derivedfrom human tissue samples, both normal, and tumor. The samples arederived from human adrenal gland, brain, heart, kidney, liver, lung,mammary gland, placenta, prostate, salivary gland, muscle, smallintestine, spleen, stomach, testes, thymus, trachea, uterus, spinalcord, skin, and dorsal root ganglion (DRG). The highest 56638 mRNAexpression was observed in spinal cord, DRG, small intestine, testes,and trachea.

[0462] TaqMan experiments in rat showed that 56638 is expressed inpituitary gland, spinal cord, brain, nerve, TRG and DRG. TaqManexperiments on rodent models of pain response showed that the 56638 geneis up-regulated in DRG 7 days after axotomy and in the CCI model ofneuropathic pain (7 days). No regulation was observed in the model ofinflammatory pain, and there was no regulation in rat spinal cord in anyof the models analyzed.

[0463] In situ hybridization experiments with the human 56638 probeshowed expression in monkey brain, a subpopulation of DRG neurons, inthe epithelium of trachea, and small intestine, as well as skin. In situhybridization in rat animal models show upregulation of the 56638 geneone and seven days after axotomy and after CFA intraplantar injection.These levels go back to normal at later time points. No contralateraleffects were observed.

[0464] Human 18610

[0465] The present invention is based, at least in part, on thediscovery of novel molecules, referred to herein as “transientreceptor”, “TR-1” or “18610” nucleic acid and polypeptide molecules,which are novel members of the transient receptor potential channelfamily. Transient receptor potential channel family members are ionchannels, e.g., calcium channels. These novel molecules are capable of,for example, modulating an ion-channel mediated activity (e.g., acalcium channel-mediated activity) in a cell, e.g., a neuronal, muscle(e.g., cardiac muscle), or liver cell.

[0466] Calcium signaling has been implicated in the regulation of avariety of cellular responses, such as growth and differentiation. Thereare two general methods by which intracellular concentrations of calciumions may be increased: calcium ions may be freed from intracellularstores, transported by specific membrane channels in the storageorganelle, or calcium ions may be brought into the cell from theextracellular milieu through the use of specific channels in thecellular membrane. In the situation in which the intracellular stores ofcalcium have been depleted, a specific type of calcium channel, termed a‘capacitative calcium channel’ or a ‘store-operated calcium channel’(SOC), is activated in the plasma membrane to import calcium ions fromthe extracellular environment to the cytosol (see Putney and McKay(1999) BioEssays 21:38-46). Calcium may also enter the cell viareceptor-stimulated cation channels (see Hofmann et al. (2000) J. Mol.Med. 78:14-25).

[0467] Members of the capacitative calcium channel family include thecalcium release-activated calcium current (CRAC) (Hoth and Penner (1992)Nature 355: 353-355), calcium release-activated non-selective cationcurrent (CRANC) (Krause et al. (1996) J. Biol. Chem. 271: 32523-32528),and the transient receptor potential (TRP) proteins TRP1, TRP2, TRP4,and TRP5. Depletion of intracellular calcium stores activate thesechannels by a mechanism which is yet undefined, but which has beendemonstrated to involve a diffusible factor using studies in whichcalcium stores were artificially depleted (e.g., by the introduction ofchelators into the cell, by activating phospholipase C_(y), or byinhibiting those enzymes responsible for pumping calcium ions into thestores or those enzymes responsible for maintaining restingintracellular calcium ion concentrations) (Putney, J. W. (1986) CellCalcium 7:1-12; Putney, J. W. (1990) Cell Calcium 11:611-624).

[0468] Recently, it has been elucidated that three TRP family members,TRP3, TRP6, and a mouse homologue, TRP7, form a sub-family of receptorsthat are activated in a calcium store-depletion independent manner. TRP3and TRP6 are activated by diacylglycerols in a membrane delimited manner(Hofmann et al. (1999) Nature 397:259263). Similarly, murine TRP7 isactivated via diacylglycerol stimulation by G_(q) protein coupledreceptors (Okada et al. (1999) J. Biol. Chem. 274:27359-27370).

[0469] The TRP channel family is one of the best characterized calciumchannel protein families. These channels include transient receptorpotential proteins and homologues thereof (to date, seven TRP homologuesand splice variants have been identified in a variety of organisms), thevanilloid receptor subtype I (also known as the capsaicin receptor); thestretch-inhibitable non-selective cation channel (SIC); the olfactory,mechanosensitive channel; the insulin-like growth factor I-regulatedcalcium channel; the vitamin D-responsive apical, epithelial calciumchannel (ECaC); and melastatin, and the polycystic kidney diseaseprotein family (see, e.g., Montell and Rubin (1989) Neuron 2:1313-1323;Caterina et al. (1997) Nature 389: 816-824; Suzuki et al. (1999) J.Biol. Chem. 274: 6330-6335; Kiselyov et al. (1998) Nature 396: 478-482;Hoenderop et al. (1999) J. Biol. Chem. 274: 8375-8378; and Chen et al.(1999) Nature 401(6751): 383-386). Each of these molecules is 700 ormore amino acids in length, and shares certain conserved structuralfeatures. Predominant among these structural features are sixtransmembrane domains, with an additional hydrophobic loop presentbetween the fifth and sixth transmembrane domains. It is believed thatthis loop is integral to the activity of the pore of the channel formedupon membrane insertion (Hardie and Minke (1993) Trends Neurosci 16:371-376). Although found in disparate tissues and organisms, members ofthe TRP channel protein family all serve to transduce signals by meansof calcium entry into cells, particularly pain signals (see, e.g.,McClesky and Gold (1999) Annu. Rev. Physiol. 61: 835-856; Harteneck, C.(2000) Trends Neurosci. 23(4): 159), light signals (Hardie and Minke,supra), or olfactory signals (Colbert et al. (1997) J. Neurosci 17(21):8259-8269). Thus, this family of molecules may play important roles insensory signal transduction in general.

[0470] As used herein, an “ion channel” includes a protein orpolypeptide which is involved in receiving, conducting, and transmittingsignals in an electrically excitable cell, e.g., a neuronal or musclecell. Ion channels include calcium channels, potassium channels, andsodium channels. As used herein, a “calcium channel” includes a proteinor polypeptide which is involved in receiving, conducting, andtransmitting calcium ion-based signals in an electrically excitablecell. Calcium channels are calcium ion selective, and can determinemembrane excitability (the ability of, for example, a neuronal cell torespond to a stimulus and to convert it into a sensory impulse). Calciumchannels can also influence the resting potential of membranes, waveforms and frequencies of action potentials, and thresholds ofexcitation. Calcium channels are typically expressed in electricallyexcitable cells, e.g., neuronal cells, and may form heteromultimericstructures (e.g., composed of more than one type of subunit). Calciumchannels may also be found in non-excitable cells (e.g., adipose cellsor liver cells), where they may play a role in, e.g., signaltransduction. Calcium channels are described in, for example, Davila etal. (1999) Annals New York Academy of Sciences 868:102-17 and McEnery,M. W. et al. (1998) J. Bioenergetics and Biomembranes 30(4): 409-418,the contents of which are incorporated herein by reference. As the TR-1molecules of the present invention are calcium channels modulating ionchannel mediated activities (e.g., calcium channel mediated activities),they may be useful for developing novel diagnostic and therapeuticagents for ion channel associated disorders (e.g., calcium channelassociated disorders).

[0471] As used herein, an “ion channel associated disorder” includes adisorder, disease or condition which is characterized by a misregulationof an ion channel mediated activity. For example, a “calcium channelassociated disorder” includes a disorder, disease or condition which ischaracterized by a misregulation of a calcium channel mediated activity.Ion channel associated disorders, e.g., calcium channel associateddisorders, include but are not limited to CNS disorders, pain disorders,cellular proliferation, growth, differentiation, or migration disorders.

[0472] As used herein, the term “pain signaling mechanisms” includes thecellular mechanisms involved in the development and regulation of pain,e.g., pain elicited by noxious chemical, mechanical, or thermal stimuli,in a subject, e.g., a mammal such as a human. In mammals, the initialdetection of noxious chemical, mechanical, or thermal stimuli, a processreferred to as “nociception”, occurs predominantly at the peripheralterminals of specialized, small diameter sensory neurons. These sensoryneurons transmit the information to the central nervous system, evokinga perception of pain or discomfort and initiating appropriate protectivereflexes. The TR-1 molecules of the present invention may be present onthese sensory neurons and, thus, may be involved in detecting thesenoxious chemical, mechanical, or thermal stimuli and transducing thisinformation into membrane depolarization events. Thus, the TR-1molecules by participating in pain signaling mechanisms, may modulatepain elicitation and act as targets for developing novel diagnostictargets and therapeutic agents to control pain.

[0473] As used herein, a “cellular proliferation, growth,differentiation, or migration process” is a process by which a cellincreases in number, size or content, by which a cell develops aspecialized set of characteristics which differ from that of othercells, or by which a cell moves closer to or further from a particularlocation or stimulus. The TR-1 molecules of the present invention areinvolved in signal transduction mechanisms, which are known to beinvolved in cellular growth, differentiation, and migration processes.Thus, the TR-1 molecules may modulate cellular growth, differentiation,or migration, and may play a role in disorders characterized byaberrantly regulated growth, differentiation, or migration. Suchdisorders include cancer, e.g., carcinoma, sarcoma, or leukemia; tumorangiogenesis and metastasis; skeletal dysplasia; neuronal deficienciesresulting from impaired neural induction and patterning; hepaticdisorders; cardiovascular disorders; and hematopoietic and/ormyeloproliferative disorders.

[0474] As used herein, an “ion channel mediated activity” includes anactivity which involves an ion channel, e.g., an ion channel in aneuronal cell, a muscular cell, or a liver cell, associated withreceiving, conducting, and transmitting signals, in, for example, thenervous system. Ion channel mediated activities (e.g., calcium channelmediated activities) include release of neurotransmitters or secondmessenger molecules (e.g., dopamine or norepinephrine), from cells,e.g., neuronal cells; modulation of resting potential of membranes, waveforms and frequencies of action potentials, and thresholds ofexcitation; participation in signal transduction pathways, andmodulation of processes such as integration of sub-threshold synapticresponses and the conductance of back-propagating action potentials in,for example, neuronal cells (e.g., changes in those action potentialsresulting in a morphological or differentiative response in the cell).

[0475] The family of TR-1 polypeptides comprise at least one“transmembrane domain” and preferably six transmembrane domains. As usedherein, the term “transmembrane domain” includes an amino acid sequenceof about 10-30 amino acid residues in length which spans the plasmamembrane. More preferably, a transmembrane domain includes about atleast 10, 15, 20, 25, or 30 amino acid residues and spans the plasmamembrane. Transmembrane domains are rich in hydrophobic residues, andtypically have an alpha-helical structure. In a preferred embodiment, atleast 50%, 60%, 70%, 80%, 90%, 95% or more of the amino acids of atransmembrane domain are hydrophobic, e.g., leucines, isoleucines,alanines, valines, phenylalanines, prolines or methionines.Transmembrane domains are described in, for example, Zagotta W. N. etal, (1996) Annual Rev. Neurosci. 19: 235-263, the contents of which areincorporated herein by reference. Amino acid residues 758-774, 856-876,923-941, 957-974, 1000-1016, and 1071-1096 of the 18610 or TR-1polypeptide comprise transmembrane domains. Accordingly, TR-1polypeptides having at least 50-60% homology, preferably about 60-70%,more preferably about 70-80%, or about 80-90% homology with atransmembrane domain of human TR-1 are within the scope of theinvention.

[0476] In another embodiment, a 18610 or TR-1-molecule of the presentinvention is identified based on the presence of at least one poredomain between the fifth and sixth transmembrane domains. As usedherein, the term “pore domain” includes an overall hydrophobic aminoacid sequence which is located between two transmembrane domains of acalcium channel protein, preferably transmembrane domains 5 and 6, andwhich is believed to be a major determinant of ion selectivity andchannel activity in calcium channels. Pore domains are described in, forexample Vannier et al. (1998) J. Biol. Chem. 273: 8675-8679 andPhillips, A. M. et al. (1992) Neuron 8, 631-642, the contents of whichare incorporated herein by reference. TR-1 molecules having at least onepore domain are within the scope of the invention. A pore domain isfound in the human TR-1 sequence (SEQ ID NO:64) at about residues1036-1055.

[0477] In another embodiment, a TR-1 molecule of the present inventionis identified based on the presence of at least one “transient receptordomain.” As used herein, the term “transient receptor domain” includes aprotein domain having an amino acid sequence of about 40-175 amino acidresidues which serves to transport ions. Preferably, a transientreceptor domain includes at least about 48 amino acid residues. Toidentify the presence of a transient receptor domain in a TR-1 protein,and make the determination that a protein of interest has a particularprofile, the amino acid sequence of the protein may be searched againsta database of known protein domains (e.g., the HMM database). Thetransient receptor domain (HMM) has been assigned the PFAM AccessionPF02164. A search was performed against the HMM database resulting inthe identification of three transient receptor domains in the amino acidsequence of human 18610 (SEQ ID NO:64) at about residues 699-747,849-1016, and 1079-1137 of SEQ ID NO:64.

[0478] A description of the Pfam database can be found in Sonhammer etal. (1997) Proteins 28:405-420 and a detailed description of HMMs can befound, for example, in Gribskov et al. (1990) Meth. Enzymol.183:146-159; Gribskov et al. (1987) Proc. Natl. Acad. Sci. USA84:4355-4358; Krogh et al. (1994) J. Mol. Biol. 235:1501-1531; andStultz et al. (1993) Protein Sci. 2:305-314, the contents of which areincorporated herein by reference.

[0479] In a preferred embodiment, the TR-1 molecules of the inventioninclude at least one transmembrane domain, preferably six transmembranedomains, at least one pore domain, and/or at least one transientreceptor domain.

[0480] Isolated polypeptides of the present invention, preferably 18610or TR-1 polypeptides, have an amino acid sequence sufficiently identicalto the amino acid sequence of SEQ ID NO:64 or are encoded by anucleotide sequence sufficiently identical to SEQ ID NO:63 or 65. Asused herein, the term “sufficiently identical” refers to a first aminoacid or nucleotide sequence which contains a sufficient or minimumnumber of identical or equivalent (e.g., an amino acid residue which hasa similar side chain) amino acid residues or nucleotides to a secondamino acid or nucleotide sequence such that the first and second aminoacid or nucleotide sequences share common structural domains or motifsand/or a common functional activity. For example, amino acid ornucleotide sequences which share common structural domains having atleast 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 85%, 90%, 91%, 92%, 93%,94%, 95%, 96%, 97%, 98%, 99% or more homology or identity across theamino acid sequences of the domains and contain at least one andpreferably two structural domains or motifs, are defined herein assufficiently identical. Furthermore, amino acid or nucleotide sequenceswhich share at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 85%, 90%,91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more homology or identityand share a common functional activity are defined herein assufficiently identical.

[0481] In a preferred embodiment, a TR-1 polypeptide includes at leastone or more of the following domains: a transmembrane domain, and/or apore domain, and/or a transient receptor domain, and has an amino acidsequence at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 85%,90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more homologous oridentical to the amino acid sequence of SEQ ID NO:64, or the amino acidsequence encoded by the DNA insert of the plasmid deposited with ATCC asAccession Number ______. In yet another preferred embodiment, a TR-1polypeptide includes at least one or more of the following domains: atransmembrane domain, and/or a pore domain, and/or a transient receptordomain, and is encoded by a nucleic acid molecule having a nucleotidesequence which hybridizes under stringent hybridization conditions to acomplement of a nucleic acid molecule comprising the nucleotide sequenceof SEQ ID NO:63 or SEQ ID NO:65. In another preferred embodiment, a TR-1polypeptide includes at least one or more of the following domains: atransmembrane domain, and/or a pore domain, and/or a transient receptordomain, and has a 18610 or TR-1 activity.

[0482] As used interchangeably herein, a “TR-1 activity”, “biologicalactivity of TR-1” or “functional activity of TR-1”, refers to anactivity exerted by a TR-1 polypeptide or nucleic acid molecule on aTR-1 responsive cell or tissue, or on a TR-1 polypeptide substrate, asdetermined in vivo, or in vitro, according to standard techniques. Inone embodiment, a TR-1 activity is a direct activity, such as anassociation with a TR-1-target molecule. As used herein, a “substrate,”“target molecule,” or “binding partner” is a molecule with which a TR-1polypeptide binds or interacts in nature, such that TR-1-mediatedfunction is achieved. A TR-1 target molecule can be a non-TR-I moleculeor a TR-1 polypeptide or polypeptide of the present invention. In anexemplary embodiment, a TR-1 target molecule is a TR-1 ligand, e.g., acalcium channel ligand such as calcium. Alternatively, a TR-1 activityis an indirect activity, such as a cellular signaling activity mediatedby interaction of the TR-1 polypeptide with a TR-1 ligand. Thebiological activities of TR-1 are described herein. For example, theTR-1 polypeptides of the present invention can have one or more of thefollowing activities: (1) modulate membrane excitability, (2) influencethe resting potential of membranes, (3) modulate wave forms andfrequencies of action potentials, (4) modulate thresholds of excitation,(5) modulate neurite outgrowth and synaptogenesis, (6) modulate signaltransduction, (7) participate in nociception, and (8) bind and transportcalcium ions.

[0483] The nucleotide sequence of the isolated human TR-1 cDNA and thepredicted amino acid sequence of the human TR-1 polypeptide are shown inSEQ ID NOs:63 and 64, respectively. A plasmid containing the nucleotidesequence encoding human TR-1 was deposited with the American TypeCulture Collection (ATCC), 10801 University Boulevard, Manassas, Va.20110-2209, on ______ and assigned Accession Number ______. This depositwill be maintained under the terms of the Budapest Treaty on theInternational Recognition of the Deposit of Microorganisms for thePurposes of Patent Procedure. This deposit was made merely as aconvenience for those of skill in the art and is not an admission that adeposit is required under 35 U.S.C. §112.

[0484] Isolation of the Human 18610 or TR-1 cDNA

[0485] The invention is based, at least in part, on the discovery of ahuman gene encoding a novel polypeptide, referred to herein as eitherhuman 18610 or TR-1. The entire sequence of the human clone Fbh18610 wasdetermined and found to contain an open reading frame termed human“18610” or “TR-1.” The nucleotide sequence of the human 18610 gene,which is 7334 nucleotides in length, is set forth in the SequenceListing as SEQ ID NO:63. The amino acid sequence of the human 18610expression product is set forth in the Sequence Listing as SEQ ID NO:64.The 18610 polypeptide comprises about 1885 amino acids. The codingregion (open reading frame) of SEQ ID NO:63 is set forth as SEQ IDNO:65. Clone Fbh18610FL, comprising the coding region of human 18610,was deposited with the American Type Culture Collection (ATCC®), 10801University. Boulevard, Manassas, Va. 20110-2209, on ______, and assignedAccession No. ______.

[0486] Analysis of the Human 18610 or TR-1 Molecules

[0487] A search using the polypeptide sequence of SEQ ID NO:64 wasperformed against the HMM database in PFAM resulting in theidentification of three potential transient receptor domains in theamino acid sequence of human TR-1 at about residues 699-747, 849-1016,and 1079-1137 of SEQ ID NO:64. A search also identified an ion transportprotein domain in the amino acid sequence of human TR-1 (SEQ ID NO:64)at about amino acid residues 884-1096 and an AN1-like zinc finger domainat about residues 33-61 of SEQ ID NO:64.

[0488] The amino acid sequence of human TR-1 was analyzed using theprogram PSORT to predict the localization of the proteins within thecell. This program assesses the presence of different targeting andlocalization amino acid sequences within the query sequence. The resultsof the analyses show the likelihood of human 18610 or TR-1 (SEQ IDNO:64) being localized, for example, to the endoplasmic reticulum, thenucleus, and the plasma membrane.

[0489] A MEMSAT analysis of the polypeptide sequence of SEQ ID NO:64 wasalso performed, predicting eight potential transmembrane domains in theamino acid sequence of human 18610 or TR-1 (SEQ ID NO:64) at aboutresidues 282-301, 507-524, 758-774, 856-876, 923-941, 957-974,1000-1016, and 1127-1146 of SEQ ID NO:64. However, a structural,hydrophobicity, and antigenicity analysis resulted in the identificationof six transmembrane domains (TM1-TM6) and one pore domain betweentransmembrane domains five and six. TM1 is at about residues 758-774 ofSEQ ID NO:64, TM2 is at about residues 856-876 of SEQ ID NO:64, TM3 isat about residues 923-941 of SEQ ID NO:64, TM4 is at about residues957-974 of SEQ ID NO:64, TM5 is at about residues 1000-1016 of SEQ IDNO:64, TM6 is at about residues 1071-1096 of SEQ ID NO:64, and the poredomain is at about residues 1036-1055 of the amino acid sequence setforth as SEQ ID NO:64.

[0490] Searches of the amino acid sequence of human 18610 were furtherperformed against the Prosite database. These searches resulted in theidentification in the amino acid sequence of human 18610 (SEQ ID NO:64)of a number of potential N-glycosylation sites at about residues404-407, 550-553, 715-718, 805-808, 925-928, 1058-1061, 1485-1488,1616-1619, 1794-1797, and 1870-1873 of SEQ ID NO:64, a number ofpotential cAMP and cGMP-dependent protein kinase phosphorylation sitesat about residues 600-603, 754-757, 1493-1496, and 1521-1524 of SEQ IDNO:64, a number of potential kinase C phosphorylation sites at aboutresidues 2-4, 12-14, 22-24, 103-105, 195-197, 318-320, 349-351, 523-525,529-531, 547-549, 615-617, 697-699, 727-729, 836-838, 842-844,1245-1247, 1410-1412, 1456-1458, 1491-1493, 1520-1522, 1547-1549,1719-1721, 1871-1873, and 1880-1882 of SEQ ID NO:64, a number ofpotential casein kinase II phosphorylation sites at about residues 5-8,12-15, 22-25, 87-90, 115-118, 299-302, 367-370, 406-409, 508-511,593-596, 603-606, 675-678, 778-781, 795-798, 883-886, 1163-1166,1191-1194, 1361-1364, 1413-1416, 1430-1433, 1524-1527, 1547-1550,1576-1579, 1635-1638, 1652-1655, 1763-1766, 1779-1782, and 1871-1874 ofSEQ ID NO:64, a number of potential tyrosine kinase phosphorylationsites at about residues 320-327, 1212-1220, and 1566-1574 of SEQ IDNO:64, a number of potential N-myristoylation sites at about residues32-37, 99-104, 159-164, 174-179, 208-213, 317-322, 357-362, 402-407,522-527, 940-945, 1293-1298, 1349-1354, 1385-1390, 1438-1443, 1556-1561,1642-1647, 1734-1739, and 1790-1795 of SEQ ID NO:64, and an amidationsite at about residues 597-600 of SEQ ID NO:64.

[0491] A search of the amino acid sequence of human 18610 (SEQ ID NO:64)was also performed against the ProDom database. The results of thissearch identified numerous matches against protein domains described as,for example, “receptor from F54D1.5 transient sequence,” “melastatin FISchromosome receptor MTR1 transmembrane,” “melastatin receptor chromosometransmembrane transient potential related,” “melastatin FIS receptorMTR1 transmembrane chromosome,” “receptor channel potential transientNOMPC TRP2 2-beta 2-alpha,” “receptor transient potential-related,”“channel receptor calcium transient potential repeat vanilloidtransmembrane ion transport,” “kinase serine/threonine-protein,ATP-binding transferase,” “kinase elongation serine/threonine-proteintransferase factor-2 eukaryotic calcium/calmodulin-dependent repeat,”“kinase receptor-like,” and the like were identified.

[0492] Tissue Distribution of Human 18610 or TR-1 mRNA by PCR Analysis

[0493] The following describes the tissue distribution of human 18610mRNA, as may be determined by Polymerase Chain Reaction (PCR) on cDNAlibraries using oligonucleotide primers based on the human 18610sequence. For in situ analysis, various tissues, e.g. tissues obtainedfrom brain, are first frozen on dry ice.

[0494] Tissue Distribution of Human 18610 or TR-1 mRNA by TaqMan™analysis

[0495] This example describes the tissue distribution of human 18610mRNA in a variety of cells and tissues, as determined using the TaqMan™procedure.

[0496] A human tissue panel was tested revealing highest expression ofhuman 18610 mRNA in the in Jurkat cells (T-cell leukemia cells) and K562cells (chronic myeloid leukemia cells), indicating a role for 18610 incellular proliferation, growth, differentiation, or migration disorderssuch as cancer.

[0497] Human 33217

[0498] The invention is based, at least in part, on the identificationof a novel AMP binding enzyme, referred to herein as “33217”. The human33217 sequence (see SEQ ID NO:66), which is approximately 2846nucleotides long including untranslated regions, contains a predictedmethionine-initiated coding sequence of about 2058 nucleotides,including the termination codon (see SEQ ID NO:68). The coding sequenceencodes a 685 amino acid protein (see SEQ ID NO:67).

[0499] Human 33217 contains the following regions or other structuralfeatures: an AMP-binding enzyme domain (PFAM Accession Number PF00501)located at about amino acid residues 144-585 of SEQ ID NO:67, whichincludes a predicted AMP-binding domain signature (PS00455) at aboutamino acids 295 to 306 of SEQ ID NO:67; two predicted N-glycosylationsites (PS00001) from about amino acids 359-362 and 608-611 of SEQ IDNO:67; one predicted glycosaminoglycan attachment site (PS00002) fromabout amino acids 56-59 of SEQ ID NO:67; one predictedcAMP/cGMP-dependent protein kinase phosphorylation site (PS00004)located at about amino acids 9-12 of SEQ ID NO:67; four predictedProtein Kinase C phosphorylation sites (PS00005) at about amino acids101-103, 144-146, 207-209, and 646-648 of SEQ ID NO:67; seven predictedCasein Kinase II phosphorylation sites (PS00006) located at about aminoacids 58-61, 69-72, 144-147, 208-211, 552-555, 579-582, and 667-670 ofSEQ ID NO:67; fourteen predicted N-myristylation sites (PS00008) fromabout amino acids 23-28, 29-34, 44-49, 163-168, 191-196, 199-204,224-229, 303-308, 328-333, 370-375, 405-410, 453-458, 462-467, and510-515 of SEQ ID NO:67; and one predicted amidation site (PS00009) fromabout amino acids 227-230 of SEQ ID NO:67.

[0500] Polypeptides of the invention include fragments which include:all or part of a hydrophobic sequence, e.g., the sequence from aboutamino acid 30 to 40, from about 185 to 200, and from about 385 to 395 ofSEQ ID NO:67; all or part of a hydrophilic sequence, e.g., the sequenceof from about amino acid 85 to 100, from about 270 to 280, and fromabout 465 to 475 of SEQ ID NO:67.

[0501] For general information regarding PFAM identifiers, PS prefix andPF prefix domain identification numbers, refer to Sonnhammer et al.(1997) Protein 28:405-420.

[0502] A plasmid containing the nucleotide sequence encoding human 33217(clone “Fbh33217FL”) was deposited with American Type Culture Collection(ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209, on andassigned Accession Number ______. This deposit will be maintained underthe terms of the Budapest Treaty on the International Recognition of theDeposit of Microorganisms for the Purposes of Patent Procedure. Thisdeposit was made merely as a convenience for those of skill in the artand is not an admission that a deposit is required under 35 U.S.C. §112.

[0503] The 33217 protein contains a significant number of structuralcharacteristics in common with members of the AMP-binding enzyme family.

[0504] Acetyl-Coenzyme A (Ac-CoA) is an activated form of acetate thatis involved in lipid biosynthesis, energy metabolism, and other normalprocesses in human cells. Ac-CoA can be generated by catabolism ofglucose (e.g., through operation of the Krebs cycle) or fatty acids.

[0505] Ac-CoA is a starting material used in biosynthesis ofcholesterol, fatty acids, lipids, and biochemical products derived fromthese (e.g., sterol and other hormones). Ac-CoA is made by ligation ofan adenylate moiety (derived by cleaving a pyrophosphonate moiety fromATP) with the acetyl carboxyl group, and then by substituting a CoAmoiety in place of the adenylate moiety. Overall, the net reaction is:acetate+CoASH+ATP→Ac-CoA+AMP+PP_(i). This reaction is catalyzed by anenzyme designated acetyl-CoA synthetase (ACS; EC 6.2.1.1; sometimesdesignated acetate-CoA ligase, acetate thiokinase, or acetyl-activatingenzyme).

[0506] ACS enzymes are involved in lipid synthesis and energygeneration. A cytosolic form of human ACS has been cloned, and an invitro enzymatic assay of ACS activity has been described (Luong et al.(2000) J. Biol. Chem. 275:26458-26466). In yeast and bacteria,expression of ACS can be induced or enhanced by one or more of adecrease in oxygen partial pressure, an increase in intracellular cAMPconcentration, and increased carbon flux through acetate-associatedmetabolic pathways (Kratzer et al. (1997) Mol. Microbiol. 26:631-641;Hiesinger et al. (1997) FEBS Lett. 415:16-20; Kumari et al. (2000) J.Bacteriol. 182:4173-4179). ACS is also up-regulated in developing plantseeds (Ke et al. (2000) Plant Physiol. 123:497-508).

[0507] The AMP-binding enzyme family of proteins is characterized by acommon domain, an “AMP-binding enzyme domain,” that permits therespective family members to act via and ATP-dependent covalent bindingof AMP to their substrates.

[0508] A 33217 polypeptide can include a “AMP-binding enzyme domain” orregions homologous with a “AMP-binding enzyme domain.”

[0509] As used herein, the term “AMP-binding enzyme domain” includes anamino acid sequence of about 250 to 600 amino acid residues in lengthand having a bit score for the alignment of the sequence to theAMP-binding enzyme domain profile (Pfam HMM) of at least 100.Preferably, a AMP-binding enzyme domain includes at least about 350 to500 amino acids, more preferably about 400 to 475 amino acid residues,or about 430 to 450 amino acids and has a bit score for the alignment ofthe sequence to the AMP-binding enzyme domain (HMM) of at least 130,150, 190 or greater. The AMP-binding enzyme domain (HMM) has beenassigned the PFAM Accession Number PF00501. Preferably, a 33217polypeptide includes an AMP-binding domain signature having theconsensus sequence[LIVMFY]-x(2)-[STG]-[STAG]-G-[ST]-[STEI]-[SG]-x-[PASLIVM]-[KR] (SEQ IDNO:70). Preferably, a 33217 polypeptide contains the AMP-binding domainsignature located at amino acids 295-306 of SEQ ID NO:67.

[0510] In a preferred embodiment 33217 polypeptide or protein has a“AMP-binding enzyme domain” or a region which includes at least about350 to 500 more preferably about 400 to 475 or 430 to 450 amino acidresidues and has at least about 50%, 60%, 70% 80% 90% 95%, 99%, or 100%homology with a “AMP-binding enzyme domain,” e.g., the AMP-bindingenzyme domain of human 33217 (e.g., residues 144 to 585 of SEQ IDNO:67).

[0511] To identify the presence of a “AMP-binding enzyme” domain in a33217 protein sequence, and make the determination that a polypeptide orprotein of interest has a particular profile, the amino acid sequence ofthe protein can be searched against the Pfam database of HMMs (e.g., thePfam database, release 2.1) using the default parameters. For example,the hmmsf program, which is available as part of the HMMER package ofsearch programs, is a family specific default program for MILPAT0063 anda score of 15 is the default threshold score for determining a hit.Alternatively, the threshold score for determining a hit can be lowered(e.g., to 8 bits). A description of the Pfam database can be found inSonhammer et al. (1997) Proteins 28(3):405-420 and a detaileddescription of HMMs can be found, for example, in Gribskov et al. (1990)Meth. Enzymol. 183:146-159; Gribskov et al. (1987) Proc. Natl. Acad.Sci. USA 84:4355-4358; Krogh et al. (1994) J. Mol. Biol. 235:1501-1531;and Stultz et al. (1993) Protein Sci. 2:305-314, the contents of whichare incorporated herein by reference. A search was performed against theHMM database resulting in the identification of a “AMP-binding enzyme”domain in the amino acid sequence of human 33217 at about residues 144to 585 of SEQ ID NO:67. The identified AMP-binding enzyme domain isdepicted in SEQ ID NO:69.

[0512] Human 33217 is predicted to be an acetyl-CoA synthetase enzyme(i.e., an acetyl-CoA ligase). Amino acid residues 205-404 of SEQ IDNO:67 align with amino acid residues 1034-1633 of a Pseudomonasaeruginosa acetyl-CoA synthetase (GENBANK™ Accession number AAG06956)with 58% sequence identity (117/200). The BLAST score for this alignmentis 642 (297.1 bits). In addition, amino acid residues 412-623 of SEQ IDNO:67 align with amino acid residues 1658-2293 of the Pseudomonasaeruginosa enzyme.

[0513] Amino acid residues 75-420 of SEQ ID NO:67 align with amino acidresidues 617-1654 of a Tetrahymena pyriformis acetyl-CoA synthetase(GENBANK™ Accession number BAA86907) with 47% sequence identity(163/346). The BLAST score for this alignment is 864 (398.8 bits). Inaddition, amino acid residues 438-554 of SEQ ID NO:67 align with aminoacid residues 1706-2056 of the Tetrahymena pyriformis enzyme, and aminoacid residues 567-612 of SEQ ID NO:67 align with amino acid residues2090-2227 of the Tetrahymena pyriformis enzyme.

[0514] A 33217 family member can include an AMP-binding enzyme domainand at least one AMP-binding domain signature. Furthermore, a 33217family member can include at least one, preferably two predictedN-glycosylation sites (PS00001); at least one predictedglycosaminoglycan attachment site (PS00002); at least one predictedcAMP/cGMP-dependent protein kinase phosphorylation site (PS00004); atleast one, two, three, and preferably four predicted protein kinase Cphosphorylation sites (PS00005); at least one, two, three, four, five,six, and preferably seven predicted casein kinase II phosphorylationsites (PS00006); and at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13,and preferably 14 predicted N-myristylation sites (PS00008); and atleast one predicted amidation site (PS00009).

[0515] As the 33217 polypeptides of the invention may modulate33217-mediated activities, they may be useful as of for developing noveldiagnostic and therapeutic agents for 33217-mediated or relateddisorders, as described below.

[0516] As used herein, a “33217 activity”, “biological activity of33217” or “functional activity of 33217”, refers to an activity exertedby a 33217 protein, polypeptide or nucleic acid molecule. For example, a33217 activity can be an activity exerted by 33217 in a physiologicalmilieu on, e.g., a 33217-responsive cell or on a 33217 substrate, e.g.,a protein substrate. A 33217 activity can be determined in vivo or invitro. In one embodiment, a 33217 activity is a activity is a directactivity, such as acetyl-CoA ligase activity, e.g., acetyl-CoAsynthetase activity (i.e., ligation of a CoA moiety with an acetatemoiety coupled with removal of a pyrophosphate moiety from ATP;formation of acetyl-CoA from acetate and CoASH catalyzed by a 33217protein, proceeding through formation of an acetyl-adenylateintermediate). A “target molecule” or “binding partner” is a moleculewith which a 33217 protein binds or interacts in nature, e.g., anintegral membrane protein. In an exemplary embodiment, 33217 is anenzyme that acts via an ATP-dependent binding of AMP to its substrate.

[0517] A 33217 activity can also be an indirect activity, e.g., acellular signaling activity mediated by interaction of the 33217 proteinwith a 33217 receptor. The features of the 33217 molecules of thepresent invention can provide similar biological activities asAMP-binding enzyme family members. For example, the 33217 proteins ofthe present invention can have one or more of the following activities:(1) acetyl-CoA ligase activity; (2) promotion of activation of acetate;(3) promotion of acetate utilization (4) enhancement of uptake ofacetate into fatty acids and biochemical products made from fatty acids(e.g., lipids and hormones such as sterol hormones); (5) promotingangiogenesis; (6) enhancing or inducing expression of genes involved inangiogenesis; (7) enhancing tumor growth; (8) enhancing tumor cellsurvival; (9) inducing neo-angiogenesis; (10) inducing aberrantangiogenesis; (11) inducing tumorigenesis; (12) enhancing tumor cellmetastasis; (13) enhancing tumor cell invasivity; or (14) agonizing orantagonizing one or more of (1)-(13).

[0518] The 33217 polypeptide is predicted to be a soluble protein thatdisplays enzymatic activity. The 33217 polypeptide is likely to belocalized in the cytosol of human cells, although it can also belocalized within mitochondria. 33217 is expressed in several types oftumor cells and is expressed at a significantly lower level (or is notexpressed) in the corresponding normal tissue. For example, 33217 isexpressed in many tumor cells of glioblastomas (i.e., a type of braintumor), but is expressed at a significantly lower level in normal braincells. Similarly, 33217 is expressed in tumor cells of papillary serousovarian tumors, but is expressed at a significantly lower level innormal ovarian cells. 33217 is also expressed in tumor cells of smallcell lung tumors, but is expressed at a significantly lower level innormal lung cells and, apparently, in lung tumor cells of other types(e.g., non-small cell lung tumor cells).

[0519] Expression of 33217 correlates with expression of angiogenicfactors, including VEGF, IL-8, Id3, and HIF-1a (as described below).Co-regulation of 33217 and known angiogenic factors is an indicationthat 33217 is among the proteins involved in promoting angiogenesis.Up-regulation of 33217 in tumor cells is an indication that this proteinis involved in angiogenesis associated with tumor growth and survival.Involvement of other ACS enzymes in cell cycling, metabolic carbon flux,and seed development in nonhuman organisms suggests that 33217 has arole in shifting the metabolism of normal cells to adjust to alteredgrowth conditions (e.g., hypoxia, metabolic changes associated with oneor more of tumorigenesis, tumor growth, tumor invasion of surroundingtissues, and metastasis). Together, these observations indicate that33217 has a role in survival, growth, invasiveness, and metastasis oftumor cells. Modulation (e.g., decrease or increase) of 33217 expressioncan therefore modulate these disease processes, indicating therapeutic,diagnostic, prognostic, and preventive utility for the nucleic acids,polypeptides, and other 33217-associated molecules described in thisdisclosure.

[0520] The 33217 enzymatic activity is predicted to include acetyl-CoAligase activity, i.e., formation of acetyl-CoA thioesters, which can beused for lipid biosynthesis (and biosynthesis of biochemicals made fromfatty acids and lipids, such as cholesterol and hormones like the sterolhormones) or oxidized and used as a cellular energy source. Inparticular, 33217 is predicted to display acetyl-CoA synthetaseactivity.

[0521] Based on the above-described sequence similarities and functionalcharacterizations, the 33217 molecules of the present invention arepredicted to have similar biological activities as AMP-binding enzymefamily members. Thus, the 33217 molecules can act as novel diagnostictargets and therapeutic agents for fatty acid metabolism disorders andfor cellular proliferative and/or differentiative disorders.

[0522] Disorders which may be treated or diagnosed by methods describedherein include, but are not limited to, adrenoleukodystrophy,hypocholesterolemia, hypercholesterolemia, and disorders associated withan accumulation in the liver of fibrous tissue, such as that resultingfrom an imbalance between production and degradation of theextracellular matrix accompanied by the collapse and condensation ofpreexisting fibers. The methods described herein can be used to diagnoseor treat hepatocellular necrosis or injury induced by a wide variety ofagents including processes which disturb homeostasis, such as aninflammatory process, tissue damage resulting from toxic injury oraltered hepatic blood flow, and infections (e.g., bacterial, viral andparasitic). For example, the methods can be used for the early detectionof hepatic injury, such as portal hypertension or hepatic fibrosis. Inaddition, the methods can be employed to detect liver fibrosisattributed to inborn errors of metabolism, for example, fibrosisresulting from a storage disorder such as Gaucher's disease (lipidabnormalities) or a glycogen storage disease, A1-antitrypsin deficiency;a disorder mediating the accumulation (e.g., storage) of an exogenoussubstance, for example, hemochromatosis (iron-overload syndrome) andcopper storage diseases (Wilson's disease), disorders resulting in theaccumulation of a toxic metabolite (e.g., tyrosinemia, fructosemia andgalactosemia) and peroxisomal disorders (e.g., Zellweger syndrome).Additionally, the methods described herein may be useful for the earlydetection and treatment of liver injury associated with theadministration of various chemicals or drugs, such as for example,methotrexate, isonizaid, oxyphenisatin, methyldopa, chlorpromazine,tolbutamide or alcohol, or which represents a hepatic manifestation of avascular disorder such as obstruction of either the intrahepatic orextrahepatic bile flow or an alteration in hepatic circulationresulting, for example, from chronic heart failure, veno-occlusivedisease, portal vein thrombosis or Budd-Chiari syndrome.

[0523] Expression of 33217 was also detected in normal kidney, Wilm'stumor, uterine adenocarcinoma, fetal adrenal (very low), fetal kidney,fetal heart, normal heart, spinal cord, and lymphangioma tissues.Accordingly, 33217 nucleic acid sequences and fragments thereof,proteins encoded by these sequences and fragments thereof, as well asmodulators of 33217 gene or protein activity can be useful in diagnosingor treating diseases that involve these tissues in which the 33217 isexpressed.

[0524] Identification and Characterization of Human 33217 cDNA

[0525] The human 33217 sequence (SEQ ID NO:66) is approximately 2846nucleotides long. The region between and inclusive of the initiationcodon and the termination codon is a methionine-initiated codingsequence of about 2058 nucleotides, including the termination codon(nucleotides indicated as “coding” of SEQ ID NO:66; SEQ ID NO:68). Thecoding sequence encodes a 685 amino acid protein (SEQ ID NO:67).

[0526] Tissue Distribution of 33217 mRNA by TaqMan Analysis

[0527] Endogenous human 33217 gene expression was determined using thePerkin-Elmer/ABI 7700 Sequence Detection System which employs TaqMantechnology.

[0528] To determine the level of 33217 in various human tissues aprimer/probe set was designed. Total RNA was prepared from a series ofcell lines or human tissues using an RNeasy kit from Qiagen. Firststrand cDNA was prepared from 1 μg total RNA using an oligo-dT primerand Superscript II reverse transcriptase (Gibco/BRL). cDNA obtained fromapproximately 50 ng total RNA was used per TaqMan reaction. Tissuestested include the human tissues and cell lines shown in Tables 42, 43,and 44.

[0529] As shown in Tables 42 and 43, expression of 33217 correlates withexpression of angiogenic factors, including VEGF, IL-8, Id3, and HIF-1a.Co-regulation of 33217 and angiogenic factors is an indication that33217 participates in angiogenic processes. TABLE 42 Co-Regulation ofExpression of 33217 and Angiogenic Factors In Normal Brain andGlioblastoma Samples Relative Expression in Relative Expression in GeneNormal Brains Glioblastomas 33217 1.0 2.5 IL-8 1.0 3.3 Id3 1.0 3.4HIF-1a 1.0 5.7

[0530] TABLE 43 Co-Regulation of 33217 and VEGF-C In Normal Brain andGlioblastoma Samples Type of Brain Relative Tissue Sample RelativeExpression Expression of Sample Designation of 33217 VEGF-C Normal BrainMCL03 1.00 1.00 Normal Brain MCL04 1.33 1.47 Normal Brain MCL06 2.263.69 Glioblastoma CHT201 2.27 3.10 Glioblastoma CHT216 2.40 3.16Glioblastoma CHT501 3.39 4.90

[0531] As shown in Table 44, expression of 33217 is highly elevated insome lung tumor samples, as compared to normal lung tissue samples.TABLE 44 Expression of 33217 in Normal Lung and Lung Tumors Type of LungTissue Sample Relative Expression of 33217 Normal 0.7 Normal 0.7 Normal1.0 Normal 0.3 Tumor 0.2 Tumor 11.4 Tumor 0.8 Tumor 0.4 Tumor 10.6 Tumor0.2 Tumor 1.1

[0532] Human 21967

[0533] The present invention is based, at least in part, on thediscovery of novel molecules, referred to herein as Lysyl OxidaseRelated-2 (“Lor-2”) molecules, “Lor-2” or “21967” nucleic acid andpolypeptide molecules, which play a role in or function in a variety ofcellular processes in the cardiovascular system, e.g., cardiac cellfunction. In another embodiment, the Lor-2 molecules of the presentinvention modulate the activity of one or more proteins involved in acardiovascular disorder, e.g., congestive heart failure, ischemia,cardiac hypertrophy, ischemic-reperfusion injury.

[0534] As used herein, the term “cardiovascular disorder” includes adisease, disorder, or state involving the cardiovascular system, e.g.,the heart, the blood vessels, and/or the blood. A cardiovasculardisorder can be caused by an imbalance in arterial pressure, amalfunction of the heart, or an occlusion of a blood vessel, e.g., by athrombus. Examples of such disorders include hypertension,atherosclerosis, coronary artery spasm, coronary artery disease,valvular disease, arrhythmias, and cardiomyopathies.

[0535] As used herein, the term “congestive heart failure” includes acondition characterized by a diminished capacity of the heart to supplythe oxygen demands of the body. Symptoms and signs of congestive heartfailure include diminished blood flow to the various tissues of thebody, accumulation of excess blood in the various organs, e.g., when theheart is unable to pump out the blood returned to it by the great veins,exertional dyspnea, fatigue, and/or peripheral edema, e.g., peripheraledema resulting from left ventricular dysfunction. Congestive heartfailure may be acute or chronic. The manifestation of congestive heartfailure usually occurs secondary to a variety of cardiac or systemicdisorders that share a temporal or permanent loss of cardiac function.Examples of such disorders include hypertension, coronary arterydisease, valvular disease, and cardiomyopathies, e.g., hypertrophic,dilative, or restrictive cardiomyopathies. Congestive heart failure isdescribed in, for example, Cohn J. N. et al. (1998) American FamilyPhysician 57:1901-04, the contents of which are incorporated herein byreference.

[0536] As used herein, the term “cardiac cellular processes” includesintra-cellular or inter-cellular processes involved in the functioningof the heart. Cellular processes involved in the nutrition andmaintenance of the heart, the development of the heart, or the abilityof the heart to pump blood to the rest of the body are intended to becovered by this term. Such processes include, for example, cardiacmuscle contraction, distribution and transmission of electricalimpulses, and cellular processes involved in the opening and closing ofthe cardiac valves. The term “cardiac cellular processes” furtherincludes processes such as the transcription, translation andpost-translational modification of proteins involved in the functioningof the heart, e.g., myofilament specific proteins, such as troponin I,troponin T, myosin light chain 1 (MLC1), and α-actinin.

[0537] Lysyl oxidase (“LOX”) is an extracellular copper enzyme thatinitiates the crosslinking of collagens and elastin by catalyzingoxidative deamination of the E-amino group in certain lysine andhydroxylysine residues of collagens and lysine residues of elastin(Smith-Mungo and Kagan (1998) Matrix Biol. 16:387-398 and Kaman inBiology of Extracellular Matrix, ed. Mecham (1986) Academic Press pp.321-389). Lysyl oxidase has been shown to be important in a variety ofcellular and physiologic processes including biogenesis of connectivetissue matrices and bone resorption. A deficiency in lysyl oxidaseactivity is found in two X-linked, recessively inherited connectivetissue disorders, the type IX variant of the Ehlers-Danlos syndrome andthe Menkes syndrome, and in the X-linked, recessively inherited mottledseries of allelic mutant mice (all characterized by abnormalities incopper metabolism). (Byers et al. (1980) New Engl. J. Med. 303:61-65;Royce et al. (1980) Biochemistry J. 192:579-586; Kuivaniemi et al.(1982) J. Clin. Invest. 69:730-733; Kuivaniemi et al. (1985) Amer. J.Human. Genet. 37:798-808; Peltonen et al. (1983) Biochemistry22:6156-6163; Rowe et al. (1977) J. Biol. Chem. 252:939-942; Starcher etal. (1977) Biochem. Biophys. Res. Commun. 78:706-712; Danks in TheMetabolic Basis of Inherited Disease”, eds. Stanbury et al. (1983),McGraw-Hill pp. 1251-1268). Increased lysyl oxidase activity has beenassociated with fibrotic disorders such as atherosclerosis,hypertension, and liver and pulmonary fibrosis. (Kagan, supra).

[0538] More recently there have been identified proteins havingstructural and/or functional similarities to lysyl oxidase. For example,a lysyl oxidase-like protein, referred to herein as “LOL”, wasidentified from a human skin fibroblast cDNA library that containsextensive homology to several coding domains within the human lysyloxidase mRNA which is believed to be involved in collagen maturation.(Kenyon et al. (1993) J. Biol. Chem. 268:18435-18437 and Kim et al.(1995) J. Biol. Chem. 270:7176-7182). Recent cloning and analysis of themouse LOL gene (Kim et al. (1999) J. Cell Biochem. 72:181-188)demonstrated that steady state levels of LOL mRNA and type IIIprocollagen mRNA increased coincidentally early in the development ofliver fibrosis. In contrast, steady state levels of lysyl oxidase mRNAincreased throughout the onset of hepatic fibrosis and appeared inparallel with the increased steady state levels of pro-alpha (I)collagen mRNA, suggesting that the LOL protein is involved in thedevelopment of lysine-derived cross-links in collagenous substrates.Moreover, the substrate specificity of the LOL protein may be differentto that of lysyl oxidase and this difference may be collagen-typespecific.

[0539] Likewise, a protein referred to herein as lysyl-oxidase relatedprotein (“Lor”) has been identified which inhibits many of thestructural features of lysyl oxidase and is overexpressed in senescentfibroblasts and is believed to play a role in age-associated changes inextracellular proteins. (Saito et al. (1997) J. Biol. Chem.272:8157-8160). Lor contains four domains referred to herein asscavenger receptor cysteine-rich domains (“SRCR domains”) which arebelieved to be involved in binding to other cell surface proteins orextracellular molecules. The SRCR domain joins a long list of otherwidely distributed cysteine-containing domains found in extracellularportions of membrane proteins and in secreted proteins (Doolittle (1985)Trends Biochem. Sci. 10:233-237; Krieger in Molecular Structures ofReceptors, eds. Rossow et al. (1986) Horwood, Chichester, U.K. pp.210-231). Examples include the EGF-like domain, immunoglobulinsuperfamily domains, the LDL receptor/complement. C9 domain, clottingfactor Kringle domains, and fibronectin domains. These disulfidecross-linked domains appear to provide stable core structures that (i)are able to withstand the rigors of the extracellular environment; (ii)are well suited for a variety of biochemical tasks, often involvingbinding; and (iii) are readily juxtaposed to other types of domains topermit the construction of complex mosaic proteins. (Doolittle supra;Sudhof et al. (1985) Science 228:815-822).

[0540] Lysyl oxidases (“LOXs”) have been immunolocalized to theextracellular matrix regions of stroma surrounding early breast cancers(Decitre et al. (1998) Lab Invest. 78:143-151), with decreasedexpression observed in the stroma surrounding invasive breast cancers(Peyrol et al. (1997) Am. J. Pathol. 150:497-507). A progressive loss ofLOX expression has also been observed during prostrate cancerprogression in mice (Ren et al. (1998) Cancer Res. 58:1285-1290). Theseobservations suggest that lysyl oxidases may function as tumorsuppressors.

[0541] It has further been shown that human Lor is highly expressed inall adherent tumor cell lines examined, but not in cell lines that growin suspension (Saito et al., supra), suggesting that LOXs can increasethe adhesion properties of tumor cells. Lor expression was demonstratedto be concomitant with upregulation of type I procollagen. As adhesionproperties contribute to the ability of tumor cells to colonize newsites, a tumor-promoting role for LOXs is also probable.

[0542] One embodiment of the invention features Lor-2 nucleic acidmolecules, preferably human Lor-2 molecules, which were identified froma cDNA library made from the heart of a patient with congestive heartfailure (CHF). The Lor-2 nucleic acid and protein molecules of theinvention are described in further detail in the following subsections.

[0543] In yet another embodiment, the isolated proteins of the presentinvention, preferably Lor-2 proteins, can be identified based on thepresence at least one SRCR domain and/or a lysyl oxidase domain and/orand a signal sequence.

[0544] In a preferred embodiment, a Lor-2 family member includes atleast 1, 2, 3, 4, or more scavenger receptor cysteine-rich (“SRCR”)domains. Scavenger receptors are proteins which have been implicated inthe development of atherosclerosis and other macrophage-associatedfunctions. For example, the type I mammalian macrophage scavengerreceptors are membrane glycoproteins implicated in the pathologicdeposition of cholesterol in arterial walls during atherogenesis(Freeman et al. (1990) Proc. Natl. Acad. Sci. U.S.A. 87:8810-8814).Scavenger receptors are characterized by the presence of a cysteine-richdomain, which is proposed to be involved in binding of physiologicalligands (e.g., cell-surface proteins). This cysteine rich domain isreferred to herein and in the art as a scavenger receptor cysteine-rich(“SRCR”) domains. Intra- or intercellular binding of ligand to the SRCRdomain is believed to play a role in signaling or adhesion

[0545] As defined herein, a SRCR domain includes a protein domain whichis about 88-112 amino acid residues in length and has about 16-60%identity with a SRCR of type I human macrophage scavenger receptor(e.g., amino acid residues 353-450 of SEQ ID NO:80). In anotherembodiment, a SRCR is abuse 90-110, 92-108, 94-106, or 95, 96, 97, 98,99, 100, 101, 102, 103, 104, 105, or 106 amino acid residues in lengthand has about 2254%, 26-50%, 28-48%, or 29%, 30%, 31%, 32%, 33%, 34%,35%, 36%, 37%, 38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, or 47%identity with a SRCR of type I human macrophage scavenger receptor(e.g., amino acid residues 353-450 of SEQ ID NO:80). For example, a SRCRdomain can be found in murine type I scavenger receptor (Accession No.1709140) from about amino acid residues 360-457. SRCR domains also havebeen found in diverse secreted and other cell-surface proteins fromhumans (e.g., CD5 and complement factor I), mice (Ly-1), and sea urchins(speract receptor). Moreover, many proteins include more than one SRCRdomain (e.g., Ly-1 includes 3 SRCR domains and the speract receptorincludes 4 SRCR domains). Likewise, human Lor-2 includes 4 SRCR domains,as set forth below.

[0546] To identify the presence of an SRCR in a Lor-2 family member, theamino acid sequence of the protein family member can be searched againsta database of HMMs (e.g., the Pfam database, release 3.3) e.g., usingthe default parameters. For example, the search can be performed usingthe hmmsf program (family specific) and threshold score of 15 fordetermining a hit. hmmsf is available as part of the HMMER package ofsearch programs (HMMER 2.1.1, December 1998) which is freely distributedby the Washington University school of medicine. In one embodiment, ahit to a SRCR HMM having a score of at least 30-40, preferably at least50-60, more preferably at least 70-80, and more preferably at least 90or more is determinative of the presence of a SRCR domain within a queryprotein. A search using the amino acid sequence of SEQ ID NO:72 wasperformed against the HMM database resulting in the identification of 4SRCR domains in the amino acid sequence of SEQ ID NO:72. Accordingly, inone embodiment of the invention, a Lor-2 protein has an SRCR domain atabout amino acids 51-145 of SEQ ID NO:72. (Score of 91.4 against theSRCR domain profile HMM Accession No. PF00530). In another embodiment, aLor-2 protein has an SRCR domain at about amino acids 183-282 of SEQ IDNO:72. (Score of 35.8). In another embodiment, a Lor-2 protein has anSRCR domain at about amino acids 310-407 of SEQ ID NO:72. (Score of128.9). In another embodiment, a Lor-2 protein has an SRCR domain atabout amino acids 420-525 of SEQ ID NO:72. (Score of 55.2).

[0547] Lor-2 family members can further include at least one or moresperact receptor repeated domain (“SRRD”) signatures. The speractreceptor is a transmembrane glycoprotein of 500 amino acid residues(Dangott et al. (1989) PNAS U.S.A. 86:2128-2132) which consists of alarge extracellular domain of 450 which contains four repeats of a ˜115amino acids termed more speract receptor repeated domain or “SRRDs”.Multiple sequence alignment of the four repeats reveals at least 17perfectly conserved residues (including six cysteines, six glycines, andthree glutamates). A SRRD signature has been generated from an alignmentof the four SRRDs and has the consensus sequence:G-x(5)-G-x(2)-E-x(6)-W-G-x(2)-C-x(3)-[FYW]-x(8)-C-x(3)-G, correspondingto SEQ ID NO:74. The SRRD signature is further described in PROSITEDocument, Accession No. PDOC00348 and as PROSITE Accession No. PS00420.In one embodiment, a SRRD signature is included within a SRCR. Forexample, a SRRD can be found in a SRCR of the C-terminal section of themammalian macrophage scavenger receptor type I (Freeman et al. (1990)PNAS U.S.A. 87:8810-8814). Likewise, a SRRD signature can be foundwithin the SRCR domain of human Lor-2 from about amino acids 312-349 ofSEQ ID NO:72.

[0548] The consensus sequences herein are described according tostandard Prosite Signature designation (e.g., all amino acids areindicated according to their universal single letter designation; Xdesignates any amino acid; X(n) designates any n amino acids, e.g., X(2) designates any 2 amino acids; [FYW] indicates any one of the aminoacids appearing within the brackets, e.g., any one of F, Y, or W, in thealternative, any one of Phe, Tyr, or Trp; and {x} indicates any aminobut the amino acid included within the brackets.)

[0549] Lor-2 family members can further include at least one domaincharacteristic of lysyl oxidase, referred to herein as a lysyl oxidasedomain or “LOX domain”. Lysyl oxidase is an extracellularcopper-dependent enzyme that catalyzes the oxidative deamination ofpeptidyl lysine residues in precursors of various collagens andelastins. The deaminated lysines are then able to form aldehydecross-links. (Krebs et al. (1993) Biochem. Biophys. Acta. 1202:7-12).The amino acid sequence of lysyl oxidase includes a signal sequence(e.g., amino acids 1 to 21 of human lysyl oxidase set forth as SEQ IDNO:75, a pro-peptide region (e.g., amino acids 22 to 168 of SEQ IDNO:75), and a region corresponding to the active, processed protein(e.g., amino acids 169-417 of SEQ ID NO:75), which is responsible forthe enzymatic function of the molecule. Lysyl oxidase can be furthercharacterized by the presence of a copper-binding site (Krebs et al.(1993) Biochem. Biophys. Acta. 12-2:7-12) having four conservedhistidine residues that presumably supply the nitrogen ligands forcopper coordination, and a quinone cofactor binding site (Wang et al.(1996) Science 273:1078-1084) (e.g., his289, his292, his294, and his296of SEQ ID NO:75), also referred to as a “copper talon”. The copperbinding site of human Lor-2 can be found, for example, at about aminoacids 286-296 of SEQ ID NO:75.

[0550] Accordingly, as used herein, the term “LOX domain” includes aprotein domain which is about 245-275 amino acid residues in length, andhas about 38-64% identity with the amino acid sequence of processedlysyl oxidase (e.g., amino acid residues 169-417 of SEQ ID NO:75).Preferably, a LOX domain is about 225-300, more preferably about 230-290amino acid residues in length, and more preferably about 235-285, or240-280 amino acid residues in length, and has about 34-65% identity,preferably about 42-62%, and more preferably about 46-56% or 50-52%identity with the amino acid sequence of processed lysyl oxidase (e.g.,amino acid residues 169-417 of SEQ ID NO:75). For example, a LOX domaincan be found in huLOL (SEQ ID NO:76) from about amino acids 310-574; inhuLor (SEQ ID NO:77) from about amino acids 481-751; in mu Lor-2 (SEQ IDNO:78) from about amino acids 464-733; and in huLor-2 (SEQ ID NO:72)from about amino acids 463-732.

[0551] In another embodiment, a LOX domain is involved in a lysyloxidase or lysyl oxidase-like function. Lysyl oxidase or lysyloxidase-like functions include, for example, aminotransferase activity,peptidyl lysine oxidation, oxidative deamination of lysine, crosslinkingof extracellular matrix components, copper binding, and/or coppermetabolism. Lysyl oxidase or lysyl oxidase-like functions are describedin detail, for example, in Kagan et al. in Catalytic Properties andstructural components of lysyl oxidase, John Wiley & Sons (1995) pp.100-121, the contents of which are incorporated herein by reference. Inyet another embodiment, a LOX domain has at least one, preferably two,and more preferably three or four histidine residues corresponding tothe conserved histidine residues of lysyl oxidase which are involved incopper binding. For example, a LOX domain of a human Lor-2 sequence setforth in SEQ ID NO:72 (e.g., amino acid residues 330-732 in SEQ IDNO:72) has four histidine residues (e.g., his604, his607, his609, andhis611 of SEQ ID NO:72) which correspond to those of human lysyl oxidaseset forth as SEQ ID NO:75.

[0552] A LOX domain in a protein can further be included within a lysyloxidase-related region (“LOX-related region”). A LOX-related regionwithin a protein (e.g., within a Lor-2 family member) includes a proteinregion which is about 380-580, preferably about 390-550, more preferablyabout 400, 420, 450 or 500 amino acid residues in length and has atleast 30-35%, 40-45%, 50-55%, 60-65%, 70-75%, 80-85%, or 90-95% homologywith, for example, the amino acid sequence of human LOX. To identify thepresence of a LOX-related region in a Lor-2 family member, the aminoacid sequence of the protein family member can be searched against theHMM database, as described previously. In one embodiment, a hit to a LOXHMM having a score of at least 100-110, preferably at least 120-130,more preferably at least 140-150, and more preferably at least 160 ormore is determinative of the presence of a LOX-related region within aquery protein. A search using the amino acid sequence of SEQ ID NO:72was performed against the HMM database resulting a hit to a LOX HMM fromabout amino acids 330-732 of SEQ ID NO:72. (Score of 166.6 against theLOX domain profile HMM Accession No. PF01186). Similar LOX-relatedregions were identified in muLor-2 from about amino acids 318-733 of SEQID NO:78 (Score of 162.8), in huLOL from about amino acids 1-574 of SEQID NO:76 (Score of 382.2) and in huLor from about amino acids 358-751 ofSEQ ID NO:77 (Score of 146.8). In yet another embodiment, a lysyloxidase-related region has at least 40-45%, 50-55%, 60-65%, 70-75%,80-85%, or 90-95% homology with the amino acid sequence of a LOX domainof a human Lor-2 sequence set forth in SEQ ID NO:72 (e.g., amino acidresidues 330-732 in SEQ ID NO:72). The lysyl oxidase-related regions ofhuLOL, huLor, muLor-2 and huLor-2 are the amino acids corresponding toprocessed lysyl oxidase (e.g., amino acids 169-417 of SEQ ID NO:75).

[0553] Another embodiment of the invention features a protein of theinvention, preferably a Lor-2 protein, which contains a signal sequence.As used herein, a “signal sequence” refers to a peptide containing about25 amino acids which occurs at the N-terminus of secretory proteins andwhich contains a large number of hydrophobic amino acid residues. Forexample, a signal sequence contains at least about 17-33 amino acidresidues, preferably about 20-30 amino acid residues, more preferablyabout 24-26 amino acid residues, and more preferably about 25 amino acidresidues, and has at least about 35-65%, preferably about 38-50%, andmore preferably about 40-45% hydrophobic amino acid residues (e.g.,Valine, Leucine, Isoleucine or Phenylalanine). Such a “signal sequence”,also referred to in the art as a “signal peptide”, serves to direct aprotein containing such a sequence to a lipid bilayer. For example, inone embodiment, a Lor-2 protein contains a signal sequence containingabout amino acids 1-25 of SEQ ID NO:72.

[0554] In yet another embodiment, a protein of the invention, preferablya Lor-2 protein, encodes a mature protein. As used herein, the term“mature protein” refers to a protein of the invention, preferably aLor-2 protein, from which the signal peptide has been cleaved. In anexemplary embodiment, a mature Lor-2 protein contains amino acidresidues 26 to 753 of SEQ ID NO:72.

[0555] In yet another embodiment, Lor-2 family members include at least1, 2, 3, 4, 5 or more N-glycosylation sites. Predicted N-glycosylationsites are found, for example, from about amino acid 111-114, 266-269,390-393, 481-484, and 625-628 of SEQ ID NO:72.

[0556] Lor-2 family members can further include at least 1, 2, 3, 4, 5,6, 7, 8, or more or more Protein kinase C (“PKC”) phosphorylation sites.Predicted PKC phosphorylation sites are found, for example, from aboutamino acid 97-99, 104-106, 221-223, 268-270, 352-354, 510-512, 564-566,and 649-651 of SEQ ID NO:72.

[0557] Lor-2 family members can further include at least 1, 2, 3, 4, 5,6, 7, 8, 9, 10, 11, 12, 13, 14, or more Casein kinase II phosphorylationsites. Predicted casein kinase II phosphorylation sites are found, forexample, from about amino acid 31-34, 68-71, 115-118, 120-123, 135-138,330-333, 352-355, 377-380, 392-395, 411-414, 424-427, 493-496, 527-530,and 617-620 of SEQ ID NO:72.

[0558] Lor-2 family members can further include at least 1, 2, 3, 4, 5,6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or moreN-myristoylation sites. Predicted N-myristoylation sites are found, forexample, from about amino acids 13-18, 116-121, 130-135, 273-278,312-317, 359-364, 378-383, 403-408, 443-448, 451-456, 463-468, 470-475,489-494, 506-511, 515-520, 521-526, 626-631, 661-666, and 746-751 of SEQID NO:72.

[0559] Lor-2 family members can further include at least one or moreamidation sites. A predicted amidation site is found, for example, fromamino acid 117-180 of SEQ ID NO:72. As used herein, the site(s) have aconsensus sequence selected from: N-{P}-[ST]-{P}(SEQ ID NO:83), where Nis a glycosylation site (see PROSITE document PS00001); [ST]-X-[RK] (SEQID NO:84), where S or T is a phosphorylation site (see PROSITE documentPS00005); [ST]-X (2)-[DE] (SEQ ID NO:85), where S or T is aphosphorylation site (see PROSITE document PS00006); G-{EDRKHPFYW}-X(2)-[STAGCN]-{P}(SEQ ID NO:86), where G is an N-myristoylation site (seePROSITE Accession No. PS00008); and X-G-[RK]-[RK] (SEQ ID NO:87), whereX is an amidation site (see PROSITE document PS00009). These sites arefurther described at the expasy website as PDOC00001, PDOC00005,PDOC00006, PDOC00008, and PS00009, respectively.

[0560] Isolated proteins of the present invention, preferably Lor-2proteins, have an amino acid sequence sufficiently homologous to theamino acid sequence of SEQ ID NO:72 or are encoded by a nucleotidesequence which includes a nucleotide sequence sufficiently homologous toSEQ ID NO:71. As used herein, the term “sufficiently homologous”includes a first amino acid or nucleotide sequence which contains atleast a minimum number of identical or equivalent (e.g., an amino acidresidue which has a similar side chain) amino acid residues ornucleotides to a second amino acid or nucleotide sequence such that thefirst and second amino acid or nucleotide sequences share commonstructural domains or motifs and/or a common functional activity. Forexample, amino acid or nucleotide sequences which share commonstructural domains have at least 30%, 40% or 50% homology, preferably55%, 60%, 65%, 70% or 75% homology, more preferably 80%, 85%, 90%, 91%,92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homology across the amino acidsequences of the domains and contain at least one and preferably twostructural domains or motifs, are defined herein as sufficientlyhomologous. Furthermore, amino acid or nucleotide sequences which shareat least 30%, 40% or 50% homology, preferably 55%, 60%, 65%, 70% or 75%homology, more preferably 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%,97%, 98% or 99% homology and share a common functional activity aredefined herein as sufficiently homologous.

[0561] Accordingly, another embodiment of the invention featuresisolated Lor-2 proteins and polypeptides having a Lor-2 activity.Preferred proteins are Lor-2 proteins having at least a signal sequence,a LOX domain, and at least one SRRD signature. Other preferred proteinsare Lor-2 proteins having at least two, three, or four SRRD signatures.Other preferred proteins are Lor-2 proteins having at least a signalsequence, a LOX domain, and a SRCR domain. Other preferred proteins areLor-2 proteins having at least a signal sequence, a LOX domain, and atleast two SCRC domains. Other preferred proteins are Lor-2 proteinshaving at least a signal sequence, a LOX domain, and at least three SCRCdomains. Other preferred proteins are Lor-2 proteins having at least asignal sequence, a LOX domain, and at least four SCRC domains.

[0562] The nucleotide sequence of the isolated human Lor-2 cDNA and thepredicted amino acid sequence of the human Lor-2 polypeptide are shownin SEQ ID NOs:71 and 72, respectively.

[0563] The human Lor-2 cDNA (set forth in SEQ ID NO:71), which isapproximately 2920 nucleotides in length, encodes a protein having amolecular weight of approximately 83.166 kD (with signal sequence) and80.404 kD (without signal sequence) and which is approximately 753 (withsignal sequence) (SEQ ID NO:72) and 728 amino acid residues (withoutsignal sequence) in length. An ˜3.0 kb Lor-2 message was found to beexpressed most tissues tested but was most highly expressed in heart andplacenta (at least heart, brain, placenta, lung, liver, skeletal muscle,kidney, and pancreas tissues were tested). High expression of Lor-2 wasalso observed in the G361 melanoma cell line and in the SW480adenocarcinoma colon cell line (at least G361, SW480, HL60, Hela 53,K562, Molty, Raji, and A549 cell lines were tested).

[0564] In a preferred embodiment, Lor-2 proteins of the invention havean amino acid sequence of at least 600-900, preferably about 650-850,more preferably about 700-800, and even more preferably about 720-760,728 or 753 amino acid residues in length.

[0565] As used interchangeably herein, a “Lor-2 activity”, “biologicalactivity of Lor-2” or “functional activity of Lor-2”, includes anactivity exerted by a Lor-2 protein, polypeptide or nucleic acidmolecule as determined in vivo, in vitro, or in situ, according tostandard techniques. In one embodiment, a Lor-2 activity is a directactivity, such as an association with a Lor-2-target molecule. As usedherein, a “target molecule” is a molecule with which a Lor-2 proteinbinds or interacts in nature, such that Lor-2-mediated function isachieved. A Lor-2 target molecule can be a Lor-2 protein or polypeptideof the present invention or a non-Lor-2 molecule. For example, a Lor-2target molecule can be a non-Lor-2 protein molecule. Alternatively, aLor-2 activity is an indirect activity, such as an activity mediated byinteraction of the Lor-2 protein with a Lor-2 target molecule such thatthe target molecule modulates a downstream cellular activity (e.g.,interaction of a Lor-2 molecule with a Lor-2 target molecule canmodulate the activity of that target molecule on a cardiac cell).

[0566] In a preferred embodiment, a Lor-2 activity is at least one ormore of the following activities: (i) interaction of a Lor-2 proteinwith a Lor-2 target molecule; (ii) interaction of a Lor-2 protein with aLor-2 target molecule, wherein the Lor-2 target is a ligand; (iii)interaction of a Lor-2 protein with a Lor-2 target molecule, wherein theLor-2 target is an extracellular matrix component (e.g., collagen orelastin); and (iv) modification of a Lor-2 target molecule (e.g.,postranslational modification).

[0567] In yet another preferred embodiment, a Lor-2 activity is at leastone or more of the following activities: (1) crosslinking anextracellular matrix component; (2) regulating bone resorption and/ormetabolism; (3) regulating copper metabolism; (4) modulating maturation,stabilization and/or degradation of extracellular matrix components; (5)regulating cellular signaling; and (6) regulating cellular adhesion(e.g. adhesion of a tumor cell).

[0568] In another embodiment of the invention, a Lor-2 molecule orpreferably, a Lor-2 modulator, is useful for regulating, preventingand/or treating at least one or more of the following diseases ordisorders: (1) diseases or disorders involving impaired coppermetabolism (e.g., type IX of the Ehlers-Danlos syndrome and the Menkessyndrome); (2) bone disorders (e.g., osteoporosis or osteoarthritis);(3) fibrotic disorders (e.g., atherosclerosis, tissue and/or organfibrosis); (4) proliferative disorders (e.g., cancer, for example,prostate cancer, breast cancer, lung cancer and the like); (5) vasculardisorders (e.g., ischemia, ischemic-reperfusion injury); and (6) cardiactrauma (e.g., iatrogenic, accidental).

[0569] In yet another embodiment of the invention, a Lor-2 molecule orpreferably, a Lor-2 modulator, is useful for regulating, preventingand/or treating at least one or more of the following diseases ordisorders: (1) cardiac hypertrophy and cardiomyopathy; (2) cardiacpathologies; (3) myocardial hypertrophy and cardiovascular lesions; (4)myocardial aneurysms; (5) atherosclerotic cardiovascular disease; (6)fibrotic disease; (7) osteoporosis; (8) metastasis/prostate cancer; (9)cellular senescence/tumor suppression; (10) tumor progression; (11)liver fibrosis; (12) wound healing; (13) hypertension; (14) diabetes;(15) arthritis; and (16) bone disease (e.g., osteoporosis orosteoarthritis).

[0570] In yet another embodiment, a Lor-2 modulator is useful forregulating (e.g., inhibiting) tumor progression. For example, Lor-2 maybe secreted by a tumor cell facilitating adhesion (e.g., enhancing theadhesive properties) of the cell. Accordingly, Lor-2 modulators can beused to affect the adhesive properties of tumor cells (e.g., tosurrounding tissues).

[0571] In yet another embodiment, a Lor-2 modulator, is useful forregulating or preventing immunosuppression by tumor cells. For example,Lor-2 may be secreted by a tumor cell, conferring on that cell a growthadvantage (e.g., maintaining the growth, differentiation, andtransformed phenotype of the tumor cell). In such a situation, secretedLor-2 can inhibit cytoxicity (e.g., lymphocytotoxicity, for example,IL-2-induced lymphocytotoxicity). Accordingly, Lor-2 may function tosuppress the generation and/or proliferation of lymphocytic cells (e.g.,lymphocyte-activated killer cells).

[0572] Isolation of the Human 21967 or Lor-2 (i.e., Lysyl OxidaseRelated-2) cDNA

[0573] The invention is based, at least in part, on the discovery of thehuman gene encoding 21967 or Lor-2. Human Lor-2 was isolated from a cDNAlibrary which was prepared from tissue obtained from subjects sufferingfrom congestive heart failure. Briefly, a cardiac tissue sample wasobtained from a biopsy of a 42 year old woman suffering from congestiveheart failure. mRNA was isolated from the cardiac tissue and a cDNAlibrary was prepared therefrom using art-known methods (described in,for example, Molecular Cloning A Laboratory Manual, 2nd Ed., ed. bySambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press:1989). Using a program which identifies the presence of signal peptides(Nielsen, H. et al. (1997) Protein Engineering 10:1-6) a positive clonewas isolated.

[0574] The sequence of the positive clone was determined and found tocontain an open reading frame. The nucleotide sequence encoding thehuman 21967 or Lor-2 protein comprises about 2920 nucleic acids, and hasthe nucleotide sequence set forth as SEQ ID NO:71. The open readingframe of 21967 is disclosed in SEQ ID NO:73. The protein encoded by thisnucleic acid comprises about 753 amino acids, and has the amino acidsequence set forth as SEQ ID NO:72.

[0575] Analysis of Human 21967 or Lor-2

[0576] A BLAST search (Altschul et al. (1990) J. Mol. Biol. 215:403) ofthe nucleotide and protein sequences of human Lor-2 revealed that Lor-2is similar to the following protein molecules: a human lysyloxidase-related protein (Accession No. U89942) having approximately56.9% identity over amino acids 33-752 of Lor-2 (SEQ ID NO:72); and asecond murine lysyl-oxidase related protein; (Accession No.AF053368)having approximately 92.6% identity over amino acids 1-753, e.g., overthe entire length) of Lor-2 (SEQ ID NO:72). (Identities were calculatedusing the ALIGN algorithm of Huang and Miller (1991) Adv. Appl. Math.12:373-381).

[0577] The Lor-2 protein is predicted to have a signal peptide fromamino acid residues 1-25 of SEQ ID NO:72. Accordingly, a mature Lor-2protein is predicted to include amino acid residues 26-753 of SEQ IDNO:72. Lor-2 is also predicted to have 5 N-glycosylation sites, 8protein kinase phosphorylation (“PKC”) sites, 14 casein kinase IIphosphorylation sites, 19 N-myristoylation sites, and 1 amidation site.Predicted N-glycosylation sites are found, for example, from about aminoacid 111-114, 266-269, 390-393, 481-484, and 625-628 of SEQ ID NO:72.Predicted PKC phosphorylation sites are found, for example, from aboutamino acid 97-99, 104-106, 221-223, 268-270, 352-354, 510-512, 564-566,and 649-651 of SEQ ID NO:72. Predicted casein kinase II phosphorylationsites are found, for example, from about amino acid 31-34, 68-71,115-118, 120-123, 135-138, 330-333, 352-355, 377-380, 392-395, 411-414,424-427, 493-496, 527-530, and 617-620 of SEQ ID NO:72. PredictedN-myristoylation sites are found, for example, from about amino acids13-18, 116-121, 130-135, 273-278, 312-317, 359-364, 378-383, 403-408,443-448, 451-456, 463-468, 470-475, 489-494, 506-511, 515-520, 521-526,626-631, 661-666, and 746-751 of SEQ ID NO:72. A predicted amidationsite is found, for example, from amino acid 117-180 of SEQ ID NO:72.

[0578] Moreover, Lor-2 has a 4 scavenger receptor cysteine-rich domainsfrom amino acid residues 51-145, 183-282, 310-407, and 420-525 of SEQ IDNO:72. The third scavenger receptor cysteine-rich domain includes asperact receptor repeated domain signature from amino acid residues312-349 of SEQ ID NO:72. Lor-2 further has a lysyl oxidase domain fromresidues 330-732 of SEQ ID NO:72. Within the lysyl oxidase domain ofLor-2, there exists a fragment having significant homology to the lysyloxidase putative copper-binding region, termed the “copper-bindingtalon”. A prosite consensus pattern describing the copper-binding talonis as follows: W-E-W-H-S-C-H-Q-H-Y-H (SEQ ID NO:79) (see also PROSITEdocumentation PDOC00716 and Krebs and Krawetz (1993) Biochem. Biophys.Acta 1202:7-12). Amino acid residues 601-701 of human Lor-2 (SEQ IDNO:72) have ˜73% identity with this consensus sequence (8/11 residues)including each of the four conserved histidines, three of which arebelieved to be copper ligands residing within an octahedral coordinationcomplex of lysyl oxidase.

[0579] Analysis of primary and secondary protein structures of 21967 wasperformed as follows: alpha, beta turn and coil regions, Garnier-Robsonalgorithm (Garnier et al. (1978) J Mol Biol 120:97); alpha, beta, andturn regions, Chou-Fasman algorithm (Chou and Fasman (1978) Adv inEnzymol Mol 47:45-148); hydrophilicity and hydrophobicity plots,Kyte-Doolittle algorithm (Kyte and Doolittle (1982) J Mol Biol157:105-132); alpha amphipathic and beta amphipathic regions, Eisenbergalgorithm (Eisenberg et al. (1982) Nature 299:371-374); flexibleregions, Karplus-Schulz algorithm (Karplus and Schulz (1985)Naturwissens-Chafen 72:212-213); antigenic index, Jameson-Wolf algorithm(Jameson and Wolf (1988) CABIOS 4:121-136); surface probability plot,Emini algorithm (Emini et al. (1985) J Virol 55:836-839).

[0580] Prediction of the Chromosomal Location of 21967 orLor-2—Electronic Mapping

[0581] To predict the chromosomal location of Lor-2, the Lor-2nucleotide sequence of SEQ ID NO:71 was used to query, using the BLASTNprogram (Altschul S. F. et al, (1990) J. Mol. Biol. 215: 403-410) with aword length of 12 and using the BLOSUM62 scoring matrix, a database ofhuman nucleotide sequences originating from nucleotide molecules (e.g.,EST sequences, STS sequences and the like) that have been mapped to thehuman genome. Nucleotide sequences which had been previously mapped tohuman chromosome 2 near the D2S145 marker (e.g., having Accession Nos.AA191602 and R55706) were found to have high sequence identity toportions of the Lor-2 nucleotide sequence (3′ UTR sequence) indicatingthat Lor-2 maps to the same chromosomal location. Moreover, it ispredicted that allelic variants of Lor-2 will map the same chromosomallocation and species orthologs of Lor-2 will map to loci syntenic withthe human Lor-2 locus.

[0582] Confirmation and Analysis of the Chromosomal Location of 21967 orLor-2—PCR Mapping

[0583] The hLor-2 gene was mapped to human chromosome 2 (i.e., 2 μl1-p13), which is syntenic to mouse chromosome 6, by PCR typing of theGenebridge (G4) radiation hybrid panel (Research Genetics, Inc.,Huntsville, Ala.). Typing of the DNA and comparison to radiation hybridmap data at the Whitehead Institute Center for Genome Research (WICGR)tightly linked the hLor-2 gene to a region on human chromosome 2 betweenWI-5987 (13.9cR) and GCT1B4 (16.7cR).

[0584] The huLor-2 primers used in the PCR mapping studies were:forward—GCTTACCAAGAAACCCATGTCAGC (SEQ ID NO:81) andreverse—GGCAGTTAGTCAGGTGCTGC (SEQ ID NO:82). The radiation hybridmapping studies were performed as follows: PCR reactions of radiationhybrid panels, GeneBridge 4 (Research Genetics, Inc., Huntsville, Ala.)were assembled in duplicate using an automated PCR assembly program on aTECAN Genesis. Each reaction consisted of: 5 μl DNA template (10 ng/μl),1.5 μl 10×PCR buffer, 1.211 dNTPs (2.5 mM), 1.15 μl forward primer (6.6μM) 1.15 μl reverse primer (6.6 μM0, and 5 μl 1:75 platinum Taq. Thereactions were thermocycled on a Perkin-Elmer 9600 for 95° C. 10 minutes(for the platinum Taq), [95° C. 40 sec, 52° C. 40 sec, 72° C., 50 sec]35×, 72° C., 5 minutes, 4° C. hold. Resulting PCR products were run outon a 2% agarose gel and visualized on a UV light box.

[0585] The positive hybrids for the Genebridge 4 panel were submitted tothe Whitehead Genome Center for placement in relation to a frameworkmap.

[0586] Human Lor-2 mapped in close proximity to known genes includingactin, gamma 2, smooth muscle, enteric (“ACTG2”), nucleolysin TIA1,semaphorin W (“SEMAW”), dysferlin (“DYSF”), docking protein 1 (“DOK1),glutamine-fructose-6-phosphate transaminase 1 (“GFPT”), the KIAA0331gene, deoxyguanosine kinase (“DGUOK”), the TSC501 gene, eukaryotictranslation initiation factor 3, subunit 10 (“EIF3S1”), tachykininreceptor 1 (“TACR1”), tissue-type plasminogen activator (“PLAT”) anddual specificity phosphatase 11 (“DUSP11”). Nearby disease mutationsand/or loci include Alstrom syndrome (“ALMS1”), an autosomal recessivelyinherited syndrome characterized by retinal degeneration, obesity,diabetes mellitus, neurogenous deafness, hepatic dysfunction, and insome cases, late onset cardiomyopathy (see e.g., Alstrom et al. (1959)Acta Psychiat. Neurol. Scand. 34 (suppl. 129):1-35; Alter and Moshang(1993) Am. J. Dis. Child. 147:97-99; Awazu et al. (1997) Am. J. Med.Genet. 69:13-16; Aynaci et al. (1995) (Letter) Clin. Genet. 48:164-166;Charles et al. (1990) J. Med. Genet. 27:590-592; Cohen and Kisch (1994)Israel J. Med. Sci. 30:234-236; Collin et al. (1997) Hum. Molec. Genet.6:213-219; Collin et al. (1999) (Letter) Clin. Genet. 55:61-62; Connollyet al. (1991) Am. J. Med. Genet. 40:421-424; Goldstein and Fialkow(1973) Medicine 52:53-71; Macari et al. (1998) Hum. Genet. 103:658-661;Marshall et al. (1997) Am. J. Med. Genet. 73:150-161; Michaud et al.(1996) J. Pediat. 128:225-229; Millay et al. (1986) Am. J. Ophthal.102:482-490; Rudiger et al. (1985) Hum. Genet. 69:76-78; Russell-Eggittet al. (1998) Ophthalmology 105: 1274-1280; Tremblay et al. (1993) Am.J. Ophthal. 115:657-665; Warren et al. (1987) Am. Heart J. 114:1522-1524and Weinstein et al. (1969) New Eng. J. Med. 281:969-977), orofacialcleft 2 (“OFC2”) (see e.g., Carinci et al. (1995) (Letter) Am. J. Hum.Genet. 56:337-339; Pezzetti et al. (1998) Genomics 50:299-305 andScapoli et al. (1997) Genomics 43:216-220) and Parkinsons disease 3 (seee.g., Di Rocco et al. (1996) Adv. Neurol. 69:3-11 and Gasser et al.(1998) Nature Genet. 18:262-265). Additional information regardingAlstrom syndrome, orofacial cleft 2 and Parkinson disease 3 can be foundcollected under Accession Nos. 203800, 602966 and 602404, respectively,in the Online Mendelian Inheritance in Man (“OMIM™”) database, thecontents of which are incorporated herein by reference.

[0587] Moreover, the syntenic location on mouse chromosome 6 is nearovarian teratoma susceptability 1 (“Ots-1”), dysruption ofcorticosterone in adrenal cortex cells (“Cor”), brain protein 1(“Brp1”), lymphocyte antigen 36 (“Ly36”), major liver protein 1(“Lvp1”), cerebellar deficient folia (“cdf”), motor neuron degeneration2 (“mnd2”), truncate (“tc”) and faded (“fe”). Of particular interest arethe Lor-2 neighbors Ots-1 and Cor, both of which a postulated to play arole in tumor susceptibility. The Ots-1 locus was identified by linkageanalysis of female LT/Sv mice, a strain characterized by its abnormallyhigh incidence of spontaneous ovarian teratomas, which are extremelyrare for other mouse strains. Ots-1 was identified as the single majorlocus that increases the frequency of teratomas in a semidominant manner(Lee et al. (1997) Cancer Res. 57:590-593. Likewise, the cor locus wasidentified as being associated with a phenotype of the AJ mouse strain(a strain susceptible to many neoplasms and infectious agents,presumably due to a deficiency in the phophylactic activities ofendogenous glucocorticoids (e.g., adrenalcortical corticosterone (“CS”))(Thaete et al. (1990) Proc. Soc. Exp. Biol. Med. 194:97-102).Accordingly, at least two loci in the near vicinity of mouse Lor-2 onchromosome 6 are associated with tumor susceptibility. Additionalinformation regarding the Ots-1 and Cor loci can be found collectedunder Accession Nos. MGI:85864 and MGI:58993, respectively, in the MouseGenomics Informatics database, the contents of which are incorporatedherein by reference. Likewise, information regarding the cdf locus, themnd2 locus and the mouse Lor-2 gene (i.e., the mouse ortholog of humanLor-2) can be found collected under Accession Nos. MGI:86274, MGI:97039and MGI:1337004, respectively.

[0588] Tissue Distribution of 21967 or Lor-2 mRNA

[0589] Standard molecular biology methods (Sambrook, J., Fritsh, E. F.,and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed., ColdSpring Harbor Laboratory, Cold Spring Harbor Laboratory Press, ColdSpring Harbor, N.Y., 1989) were used to construct cDNA libraries inplasmid vectors from multiple human tissues. Individual cDNA clones fromeach library were isolated and sequenced and their nucleotide sequenceswere input into a database. The Lor-2 nucleotide sequence of SEQ IDNO:71 was used to query the tissue-specific library cDNA clonenucleotide sequence database using the BLASTN program (Altschul S. F. etal, (1990) J. Mol. Biol. 215: 403-410) with a word length of 12 andusing the BLOSUM62 scoring matrix. Nucleotide sequences identical toportions of the Lor-2 nucleotide sequence of SEQ ID NO:71 were found incDNA libraries originating from human endothelial cells, lymph node,bone, heart, neuron, and testes. Lor-2 nucleic acid sequences, fragmentsthereof, proteins encoded by these sequences, and fragments thereof aswell as modulators of Lor-2 gene or protein activity may be useful fordiagnosing or treating diseases that involve the tissues in which theLor-2 mRNA is expressed. Likewise, when a similar analysis was performedusing the Lor-2 sequence of SEQ ID NO:71 to query publicly availablenucleotide sequence databases (e.g., DBEST databases) using BLAST,sequences having high homology to the 3′ untranslated region of humanLor-2 were identified in a Soares placenta normalized library and inSoares testis, B-cell and lung normalized libraries.

[0590] Northern blot hybridization with RNA samples was next performedunder standard conditions and washed under stringent conditions, i.e.,0.2×SSC at 65° C. A DNA probe was radioactively labeled with ³²P-dCTPusing the Prime-It kit (Stratagene, La Jolla, Calif.) according to theinstructions of the supplier. Filters containing various tissue and cellline mRNAs were probed in ExpressHyb hybridization solution (Clontech)and washed at high stringency according to manufacturer'srecommendations.

[0591] On a human mRNA blot containing mRNA from heart, brain, placenta,lung, liver, skeletal muscle, kidney, and pancreas, Lor-2 transcript(˜3.0 kb) was detected in all tissues tested but was most stronglydetected in heart and placenta. Moreover, Lor-2 mRNA was stronglyexpressed in the G361 melanoma cell line and in the SW480 adenocarcinomacolon cell lines (as compared to expression in the HL60, HeLa53, K562,Molty, Raji, and SW480 cell lines (SW480 cell line expressing a 2.4 kbtranscript). Transcripts of 5 kb and 2 kb were also detected evidencingpossible splice variants of Lor-2.

[0592] Testing of a larger panel of human tissues revealed the followingexpression levels. Expression levels were normalized to beta 2expression. TABLE 45 hu Lor-2 Expression in Normal Tissues huLor-2 Beta2 Relative Tissue Source Expression Expression Expression* Lymph Node(MPI 79) 30.550 18.170 10.78 Lymph Node (NDR 173) 29.930 19.190 33.59Heart (PIT 272) 26.145 18.170 57.06 Heart (PIT 273) 29.375 19.110 46.85Lung (MPI 131) 29.650 19.480 50.04 Lung (NDR 185) 27.165 17.050 51.96Kidney (MPI 58) 30.695 20.790 60.13 Spleen (MPI 360) 27.005 17.150 62.25SK Muscle (MPI 38) 29.480 20.400 106.15 Fetal Liver (MPI 425) 30.06520.520 75.85 Fetal Liver (MPI 133) 31.570 23.550 221.32 Tonsil (MPI 37)29.480 17.890 18.64 Colon (MPI 383) 30.045 19.830 48.50 Brain (MPI 422)30.525 22.220 181.65 Liver (MPI 75) 32.935 20.940 14.07 Liver (MPI 365)31.060 18.770 11.35 Liver (MPI 339) 33.985 20.740 5.92 Liver (MPI 154)32.000 19.970 13.74 Liver (NDR 206) 33.750 20.370 5.41 Liver (PIT 260)32.705 18.970 4.23 CD14 26.945 17.190 66.49 Granulocytes 30.825 19.24018.77 NHLH (resting) 36.595 19.920 1.10 NHLH (activated) 35.570 19.7601.00 Liver Fibrosis (MPI 447) 29.320 18.300 27.67 Liver Fibrosis (NDR190) 36.495 24.180 22.55 Liver Fibrosis (NDR 191) 30.105 19.770 44.63Liver Fibrosis (NDR 192) 33.415 22.410 27.95 Liver Fibrosis (NDR 193)30.795 19.830 28.74 Liver Fibrosis (NDR 204) 33.360 21.580 16.34 LiverFibrosis (NDR 126) 31.900 21.180 34.18 Liver Fibrosis (NDR 113) 29.17518.510 36.51 Liver Fibrosis (NDR 79) 30.870 20.390 40.22 Liver Fibrosis(NDR 112) 31.955 21.770 49.52 Liver Fibrosis (NDR 225) 30.645 20.35045.89 Liver Fibrosis (NDR 141) 33.045 22.250 32.45

[0593] Next, Lor-2 expression levels were measured in a variety oftissue and cell samples using the TaqMan™ procedure. TABLE 46 hu Lor-23′ UTR Expression in Normal Human Tissues Relative Relative TissueSource Expression* Tissue Source Expression* Prostate 2.5 Aorta 11.8Prostate 10.9 Testis 16.4 Liver 2.4 Testis 21.7 Liver 2.5 Thyroid 4.4Breast 26.7 Thyroid 7.2 Breast 59.3 Placenta 73.3 Skeletal Muscle 13.4Placenta 61.8 Skeletal Muscle 5.5 Fetal Kidney 87.7 Brain 12.6 FetalLiver 10.0 Brain 12.7 Fetal Liver 64.7 Colon 7.2 Fetal Heart 14.4 Colon3.4 Fetal Heart 70.8 Heart 1.8 Osteoblasts 207.9 (undif.) Heart 1.8Osteoblasts 128.0 (dif.) Ovary 1.8 Small Intestine 7.9 Ovary 1.4 Cervix86.5 Kidney 1.0 Spleen 6.3 Kidney 2.3 Esophagus 2.4 Lung 1.8 Thymus 1.4Lung 4.2 Tonsil 1.7 Vein 57.5 Lymphnode 3.1 Vein 16.1

[0594] The highest expression was observed in osteoblasts, cervix,kidney and placenta on the normal human tissue panel tested.

[0595] Expression of 21967 or Lor-2 mRNA in Clinical Tumor Samples andin Xenograft Cell Lines

[0596] In this example, RT-PCR was used to detect the presence of Lor-2mRNA in various tumor and metastatic tissue samples as compared tonormal tissue samples. RT-PCR was also used to detect the presence ofLor-2 mRNA in various xenograft cell lines. In breast tissue, Lor-2 mRNAwas detected in 0/1 normal tissue samples as compared to 3/4 tumorclinical samples after 30 cycles of PCR. In xenograft cell linesisolated from breast tissue, Lor-2 mRNA was detected in 1/1 normal and3/3 xenograft cell lines (cell lines MCF7, ZR75 and T47D). In lungtissue, Lor-2 mRNA was detected in 0/2 normal tissue samples as comparedto 2/8 tumor tissue samples. In xenograft cell lines isolated from lungtissue, Lor-2 mRNA was detected in 0/5 xenograft cell lines after 30cycles of PCR. In a second experiment performed with lung tissue, Lor-2mRNA was detected in 2/2 normal and 8/8 tumor tissue samples, as well asin 5/5 xenograft cell lines (cell lines A549, H69, H125, H322 and H460)after 35 cycles of PCR. In colon tissue, Lor-2 mRNA was detected in 2/2normal, 5/5 tumor and 5/5 metastatic samples, as well as in 7/7xenograft cell lines (cell lines HCT116, HCT15, HT29, SW620, SW480, DLD1and KM12) after 35 cycles of PCR. In liver tissue, LOR-2 mRNA wasdetected in 2/2 normal samples after 35 cycles of PCR. These data revealthat there exists a correlation between tumors and Lor-2 expression, atleast in breast and lung tissues.

[0597] To further investigate this finding, Lor-2 mRNA levels weremeasured by quantitative PCR using the TaqMan™ procedure as describedabove. The procedure was carried out on cDNA generated from variouscarcinoma samples and compared to normal counterpart tissue samples. In5/7 breast carcinomas, a 2-86 fold upregulation of Lor-2 was observed ascompared to 2/4 normal breast tissue samples. Likewise, in 4/7 lungcarcinomas, a 2-17 fold upregulation was observed as compared to 3/4normal lung tissue samples. The relative levels of Lor-2 mRNA detectedin various normal, tumor and metastases samples are set forth in Table47. TABLE 47 hu Lor-2 Expression - TaqMan Analysis of Oncology PanelTissue Relative Tissue Relative Source Expression Source ExpressionBreast N 46.85 Colon N 48.50 Breast N 18.96 Colon N 4.94 Breast N 1.00Colon N 10.09 Breast N 11.75 Colon N 4.94 Breast T 86.52 Colon T 10.78Breast T 37.27 Colon T 10.89 Breast T 25.72 Colon T 17.39 Breast T 60.76Colon T 10.82 Breast T 19.84 Colon T 9.09 Breast T 22.24 Colon T 26.63Breast T 16.26 Liver Met 10.93 Lung N 9.32 Liver Met 10.30 Lung N 3.34Liver Met 12.25 Lung N 1.65 Liver Met 12.91 Lung N 3.84 Liver N 4.30Lung T 4.26 Liver N 3.69 Lung T 7.39 Liver N 3.48 Lung T 9.13 Liver N5.41 Lung T 12.08 Lung T 6.48 Lung T 17.27 Lung T 28.15

[0598] These data reveal a significant upregulation of Lor-2 mRNA in atleast breast and lung carcinomas. Moreover, there was a significantupregulation of Lor-2 expression in metastatic as compared to normalliver samples. Given that the mRNA for Lor-2 is expressed in a varietyof tumors, with significant upregulation in carcinoma samples incomparison to normal samples, it is believed that inhibition of Lor-2activity may inhibit tumor progression by affecting the adhesiveproperties of the tumor cells to surrounding tissues.

[0599] Human 1983 (SLGP)

[0600] The present invention is based, at least in part, on thediscovery of novel G-protein coupled receptor (GPCR) family members,referred to herein as SLGP protein and nucleic acid molecules. The humanSLGP molecules are also referred to as “1983” molecules and the mouseSLGP molecules are also referred to as “12231 or “m1983” molecules. Thepresent invention also provides methods and compositions for thediagnosis and treatment of cellular proliferation, growth,differentiation, or migration disorders (e.g., cancer, arthritis,retinal and optic disk neovascularization, and tissue ischemia, such asmyocardial ischemia).

[0601] The present invention is also based, at least in part, on thediscovery that the novel SLGP molecules of the present invention areupregulated in in vitro proliferating and tube forming Human DermalMicrovascular Endothelial Cells (HMVEC) (see details below), areexpressed in endothelial cells of glioblastomas as compared to normalbrains (see details below), and are upregulated in VEGF-inducedangiogenic xenograft plugs as compared to parental xenografts (seedetails below). Therefore, the SLGP molecules of the present inventionmodulate angiogenesis by endothelial cells (e.g., tumor endothelialcells). Accordingly, the SLGP molecules of the present invention areuseful as targets for developing modulating agents to regulate a varietyof cellular processes including angiogenesis (e.g., the proliferation,elongation, and migration of endothelial cells, such as endothelialcells in tumors). Angiogenesis is responsible for the formation of newvessels in tumor sites. The new vessels provide the oxygen andnutritional supply to tumors. Therefore, the SLGP modulators of theinvention can modulate tumor formation and growth by modulatingangiogenesis. For example, inhibition of the activity of an SLGPmolecule can cause decreased angiogenesis, i.e., a decrease in cellularproliferation, elongation, and migration of endothelial cells and, thus,a decrease in the formation of new vessels, and a decrease in the supplyof oxygen and nutrition to a tumor. Therefore, the SLGP modulators ofthe invention can be used to treat formation and growth of tumors, e.g.,cancer, and other diseases characterized by excessive vessel formationsuch as arthritis and retinopathy. Additionally, increasing the activityof an SLGP molecule can cause increased angiogenesis and, therefore,increased vessel formation and can, thus, be used in treating diseasescharacterized by decreased vessel formation, e.g., tissue ischemia.Therefore, the SLGP molecules of the present invention are useful astargets and therapeutic agents for the modulation of diseasescharacterized by decreased angiogenesis, e.g., tissue ischemia, such asmyocardial ischemia.

[0602] The SLGP protein is a GPCR that participates in signalingpathways within cells, e.g., signaling pathways involved inproliferation or differentiation. As used herein, a signaling pathwayrefers to the modulation (e.g., the stimulation or inhibition) of acellular function/activity upon the binding of a ligand to the GPCR(SLGP protein). Examples of such functions include mobilization ofintracellular molecules that participate in a signal transductionpathway, e.g., phosphatidylinositol 4,5-bisphosphate (PIP₂), inositol1,4,5-triphosphate (IP₃) or adenylate cyclase; polarization of theplasma membrane; production or secretion of molecules; alteration in thestructure of a cellular component; cell proliferation, e.g., synthesisof DNA and angiogenesis, e.g., proliferation, elongation, and migrationof endothelial cells (e.g., tumor endothelial cells) to form new vessels(e.g., endothelial tubes); cell differentiation; and cell survival.

[0603] Regardless of the cellular activity modulated by SLGP, it isuniversal that as a GPCR, the SLGP protein interacts with a “G protein”to produce one or more secondary signals in a variety of intracellularsignal transduction pathways, e.g., through phosphatidylinositol orcyclic AMP metabolism and turnover, in a cell. G proteins represent afamily of heterotrimeric proteins composed of α, β and γ subunits, whichbind guanine nucleotides. These proteins are usually linked to cellsurface receptors, e.g., receptors containing seven transmembranedomains, such as the ligand receptors. Following ligand binding to thereceptor, a conformational change is transmitted to the G protein, whichcauses the α-subunit to exchange a bound GDP molecule for a GTP moleculeand to dissociate from the βγ-subunits. The GTP-bound form of theα-subunit typically functions as an effector-modulating moiety, leadingto the production of second messengers, such as cyclic AMP (e.g., byactivation of adenylate cyclase), diacylglycerol or inositol phosphates.Greater than 20 different types of α-subunits are known in man, whichassociate with a smaller pool of β and γ subunits. Examples of mammalianG proteins include Gi, Go, Gq, Gs and Gt. G proteins are describedextensively in Lodish H. et al. Molecular Cell Biology, (ScientificAmerican Books Inc., New York, N.Y., 1995), the contents of which areincorporated herein by reference.

[0604] As used herein, the phrase “phosphatidylinositol turnover andmetabolism” includes the molecules involved in the turnover andmetabolism of phosphatidylinositol 4,5-bisphosphate (PIP₂) as well as tothe activities of these molecules. PIP₂ is a phospholipid found in thecytosolic leaflet of the plasma membrane. Binding of a ligand to theSLGP activates, in some cells, the plasma-membrane enzyme phospholipaseC that in turn can hydrolyze PIP₂ to produce 1,2-diacylglycerol (DAG)and inositol 1,4,5-tri phosphate (IP₃). Once formed IP₃ can diffuse tothe endoplasmic reticulum surface where it can bind an IP3 receptor,e.g., a calcium channel protein containing an IP3 binding site. IP₃binding can induce opening of the channel, allowing calcium ions to bereleased into the cytoplasm. IP₃ can also be phosphorylated by aspecific kinase to form inositol 1,3,4,5-tetraphosphate (IP₄), amolecule which can cause calcium entry into the cytoplasm from theextracellular medium. IP₃ and IP₄ can subsequently be hydrolyzed veryrapidly to the inactive products inositol 1,4-biphosphate (IP₂) andinositol 1,3,4-triphosphate, respectively. These inactive products canbe recycled by the cell to synthesize IP₂. The other second messengerproduced by the hydrolysis of IP₂ namely 1,2-diacylglycerol (DAG),remains in the cell membrane where it can serve to activate the enzymeprotein kinase C. Protein kinase C is usually found soluble in thecytoplasm of the cell, but upon an increase in the intracellular calciumconcentration, this enzyme can move to the plasma membrane where it canbe activated by DAG. The activation of protein kinase C in differentcells results in various cellular responses such as the phosphorylationof glycogen synthase, or the phosphorylation of various transcriptionfactors, e.g., NF-kB. The language “phosphatidylinositol activity”, asused herein, includes an activity of PIP₂ or one of its metabolites.

[0605] Another signaling pathway in which the SLGP protein mayparticipate is the cAMP turnover pathway. As used herein, “cyclic AMPturnover and metabolism” includes molecules involved in the turnover andmetabolism of cyclic AMP (cAMP) as well as to the activities of thesemolecules. Cyclic AMP is a second messenger produced in response toligand induced stimulation of certain G protein coupled receptors. Inthe ligand signaling pathway, binding of ligand to a ligand receptor canlead to the activation of the enzyme adenylate cyclase, which catalyzesthe synthesis of cAMP. The newly synthesized cAMP can in turn activate acAMP-dependent protein kinase.

[0606] The SLGP molecules of the present invention are involved inmodulation of cellular proliferation, growth, differentiation, ormigration processes. As used herein, a “cellular proliferation, growth,differentiation, or migration process” includes a process by which acell e.g., an endothelial cell, increases in number, size, or content;by which a cell develops a specialized set of characteristics whichdiffer from that of other cells; or by which a cell moves closer to orfurther from a particular location or stimulus (e.g., angiogenesis). Asused herein, “cellular proliferation, growth, differentiation, ormigration disorders” include cancer, e.g., carcinoma, sarcoma, orleukemia; tumor angiogenesis and metastasis; and other diseases whichare characterized by increased or deceased angiogenesis, including, butnot limited to arthritis, retinal and optic disk neovascularization, andtissue ischemia, such as myocardial ischemia.

[0607] The activity of the SLGP proteins of the invention may also beimplicated in cardiovascular disorders, congestive heart failure, orother cardiac cellular processes. As used herein, the term“cardiovascular disorder” includes a disease, disorder, or stateinvolving the cardiovascular system, e.g., the heart, the blood vessels,and/or the blood. A cardiovascular disorder can be caused by animbalance in arterial pressure, a malfunction of the heart, or anocclusion of a blood vessel, e.g., by a thrombus. Examples of suchdisorders include hypertension, atherosclerosis, coronary artery spasm,coronary artery disease, valvular disease, arrhythmias, cardiomyopathies(e.g., dilated cardiomyopathy, idiopathic cardiomyopathy),arteriosclerosis, ischemia reperfusion injury, restenosis, arterialinflammation, vascular wall remodeling, ventricular remodeling, rapidventricular pacing, coronary microembolism, tachycardia, bradycardia,pressure overload, aortic bending, coronary artery ligation, vascularheart disease, atrial fibrilation, long-QT syndrome, congestive heartfailure, sinus node disfunction, angina, heart failure, hypertension,atrial fibrillation, atrial flutter, myocardial infarction, cardiachypertrophy, and coronary artery spasm.

[0608] As used herein, the term “congestive heart failure” includes acondition characterized by a diminished capacity of the heart to supplythe oxygen demands of the body. Symptoms and signs of congestive heartfailure include diminished blood flow to the various tissues of thebody, accumulation of excess blood in the various organs, e.g., when theheart is unable to pump out the blood returned to it by the great veins,exertional dyspnea, fatigue, and/or peripheral edema, e.g., peripheraledema resulting from left ventricular dysfunction. Congestive heartfailure may be acute or chronic. The manifestation of congestive heartfailure usually occurs secondary to a variety of cardiac or systemicdisorders that share a temporal or permanent loss of cardiac function.Examples of such disorders include hypertension, coronary arterydisease, valvular disease, and cardiomyopathies, e.g., hypertrophic,dilative, or restrictive cardiomyopathies. Congestive heart failure isdescribed in, for example, Cohn J. N. et al. (1998) American FamilyPhysician 57:1901-04, the contents of which are incorporated herein byreference.

[0609] As used herein, the term “cardiac cellular processes” includesintra-cellular or inter-cellular processes involved in the functioningof the heart. Cellular processes involved in the nutrition andmaintenance of the heart, the development of the heart, or the abilityof the heart to pump blood to the rest of the body are intended to becovered by this term. Such processes include, for example, cardiacmuscle contraction, distribution and transmission of electricalimpulses, and cellular processes involved in the opening and closing ofthe cardiac valves. The term “cardiac cellular processes” furtherincludes processes such as the transcription, translation andpost-translational modification of proteins involved in the functioningof the heart, e.g., myofilament specific proteins, such as troponin I,troponin T, myosin light chain 1 (MLC1), and α-actinin.

[0610] The novel SLGP molecules of the present invention comprise afamily of molecules having certain conserved structural and functionalfeatures. The term “family” when referring to the protein and nucleicacid molecules of the invention is intended to mean two or more proteinsor nucleic acid molecules having a common structural domain or motif andhaving sufficient amino acid or nucleotide sequence homology as definedherein. Such family members can be naturally or non-naturally occurringand can be from either the same or different species. For example, afamily can contain a first protein of human origin, as well as other,distinct proteins of human origin or alternatively, can containhomologues of non-human origin. Members of a family may also have commonfunctional characteristics.

[0611] For example, the family of G protein-coupled receptors (GPCRs),to which the SLGP proteins of the present invention bear significanthomology, comprise an N-terminal domain, seven transmembrane domains(also referred to as membrane-spanning domains), six loop domains, and aC-terminal cytoplasmic domain (also referred to as a cytoplasmic tail).Members of the SLGP family also share certain conserved amino acidresidues, some of which have been determined to be critical to receptorfunction and/or G protein signaling. For example, GPCRs usually containthe following features: a conserved asparagine residue in the firsttransmembrane domain; a cysteine residue in the second loop which isbelieved to form a disulfide bond with a conserved cysteine residue inthe fourth loop; a conserved leucine and aspartate residue in the secondtransmembrane domain; an aspartate-arginine-tyrosine motif (DRY motif)at the interface of the third transmembrane domain and the third loop ofwhich the arginine residue is almost invariant (members of the rhodopsinsubfamily of GPCRs comprise a histidine-arginine-methionine motif (HRMmotif) as compared to a DRY motif); a conserved tryptophan and prolineresidue in the fourth transmembrane domain; and conserved phenylalanineand leucine residues in the seventh transmembrane domain. Table 48depicts an alignment of the transmembrane domain of 5 GPCRs. Theconserved residues described herein are indicated by asterices. TABLE 48Alignment of Transmembrane Domains thrombin  (6.) human P25116 rhodopsin(19.) human P08100 m1ACh (21.) rat P08482 IL-8A (30.) human P25024octopamine (40.) Drosophila melanogaster P22270 TM1                   *6. 102 TLFVPSVYTGVFVVSLPLNIMAIVVFILKMK 132 19. 37FSMLAAYMFLLIVLGFPINFLTLYVTVQHKK 67 21. 25VAFIGITTGLLSLATVTGNLLVLISFKVNTE 55 30. 39KYVVIIAYALVFLLSLLGNSLVMLVILYSRV 69 40. 109ALLTALVLSVIIVLTIIGNILVILSVFTYKP 139                   |1111111111111111111111111111111 33333333444444444455555555556662345678901234567890123456789012 TM2       *   * 6. 138VVYMLHLATADVLFVSVLPFKISYYFSG 165 19. 73 NYILLNLAVADLFMVLGGFTSTLYTSLH 10021. 61 NYFLLSLACADLIIGTFSMNLYTTYLLM 88 30. 75DVYLLNLALADLLFALTLPIWAASKVNG 102 40. 145 NFFIVSLAVADLTVALLVLPFNVAYSIL172           | 22222222222222222222222222224444444444555555555566666666 0123456789012345678901234567 TM3                        * 6. 176 RFVTAAFYCNMYASILLMTVISIDR 200 19. 111NLEGFFATLGGEIALWSLVVLAIER 135 21. 99 DLWLALDYVASNASVMNLLLISFDR 123 30.111 KVVSLLKEVNFYSGILLLACISVDR 135 40. 183 KLWLTCDVLCCTSSILNLCAIALDR 207                        | 33333333333333333333333332222333333333344444444445 6789012345678901234567890 TM4            *        * 6. 215 TLGRASFTCLAIWALAIAGVVPLVLKE 241 19. 149GENHAIMGVAFTWVMALACAAPPLAGW 175 21. 138 TPRRAALMIGLAWLVSFVLWAPAILFW 16430. 149 KRHLVKFVCLGCWGLSMNLSLPFFLFR 175 40. 222TVGRVLLLISGVWLLSLLISSPPLIGW 248             |444444444444444444444444444 334444444444555555555566666890123456789012345678901234 TM5            *  *       * 6. 268AYYFSAFSAVFFFVPLIISTVCYVSIIRC 296 19. 201 ESFVIYMFVVHFTIPMIIIFFCYGQLVFT229 21. 186 PIITFGTAMAAFYLPVTVMCTLYWRIYRE 214 30. 200MVLRILPHTFGFIVPLFVMLFCYGFTLRT 228 40. 267 RGYVIYSSLGSFFIPLAIMTIVYIEIFVA295               | 5555555555555555555555555555533334444444444555555555566666 67890123456789012345678901234 TM6         *  *  * 6. 313 FLSAAVFCIFIICFGPTNVLLIAHYSFL 340 19. 252RMVIIMVTAFLICWVPYASVAFYIFTHQ 279 21. 365 RTLSAILLAFILTWTPYNIMVLVSTFCK397 30. 242 RVIFAVVLIFLLCWLPYNLVLLADTLMR 269 40. 529RTLGIIMGVFVICWLPFFLMYVILPFCQ 556                |6666666666666666666666666666   33333444444444455555555566665678901234567890123456789012 TM7                     **  * 6. 347EAAYFAYLLCVCVSSISSCIDPLIYYYASSECQ 379 19. 282NFGPIFMTIPAFFAKSAAIYNPVIYIMMNKQFR 314 21. 394CVPETLWELGYWLCYVNSTVNPMCYALCNKAFR 426 30. 281NNIGRALDATEILGFLHSCLNPIIYAFIGQNFR 313 40. 559CPTNKFKNFITWLGYINSGLNPVIYTIFNLDYR 591                      |777777777777777777777777777777777 233333333334444444444555555555566901234567890123456789012345678901

[0612] The amino acid sequences of thrombin (Accession No. P25116),rhodopsin (Accession No. P08100), m1ACh (Accession No. P08482), IL-8A(Accession No. P25024), octopamine (Accession No. P22270), can be foundas SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95,respectively. Accordingly, GPCR-like proteins such as the SLGP proteinsof the present invention contain a significant number of structuralcharacteristics of the GPCR family. For instance, the SLGPs of thepresent invention contain conserved cysteines found in the first twoloops (prior to the third and fifth transmembrane domains) of most GPCRs(cys490 and cys562 of SEQ ID NO:89). A highly conserved asparagineresidue is present (asn125 in SEQ ID NO:89). SLGP proteins contains ahighly conserved leucine (leu154 of SEQ ID NO:89). The two cysteineresidues are believed to form a disulfide bond that stabilizes thefunctional protein structure. A highly conserved asparagine and argininein the fourth transmembrane domain of the SLGP proteins is present(asp158 and arg218 of SEQ ID NO:89). Moreover, a highly conservedproline is present (pro307 of SEQ ID NO:89). Proline residues in thefourth, fifth, sixth, and seventh transmembrane domains are thought tointroduce kinks in the alpha-helices and may be important in theformation of the ligand binding pocket. Moreover, a conserved tyrosineis present in the seventh transmembrane domain of SLGP-2 (tyr647 of SEQID NO:89).

[0613] In one embodiment, the SLGP proteins of the present inventioncontain at least one, two, three, four, five, six, or preferably, seventransmembrane domains. As used herein, the term “transmembrane domain”includes an amino acid sequence of about 15-40 amino acid residues inlength, more preferably, about 15-30 amino acid residues in length, andmost preferably about 18-25 amino acid residues in length, which spansthe plasma membrane. Transmembrane domains are rich in hydrophobicresidues, and typically have an α-helical structure. In a preferredembodiment, at least 50%, 60%, 70%, 80%, 90%, 95% or more of the aminoacids of a transmembrane domain are hydrophobic, e.g., leucines,isoleucines, tyrosines, or tryptophans. Transmembrane domains aredescribed in, for example, Zagotta W. N. et al, (1996) Annual Rev.Neuronsci. 19: 235-63, the contents of which are incorporated herein byreference. In a preferred embodiment, an SLGP protein of the presentinvention has more than one transmembrane domain, preferably 2, 3, 4, 5,6, or 7 transmembrane domains. For example, transmembrane domains can befound at about amino acids 433-452, 465-481, 500-524, 533-553, 570-594,619-635, and 642-666 of SEQ ID NO:89. In a particularly preferredembodiment, an SLGP protein of the present invention has 7 transmembranedomains.

[0614] In another embodiment, an SLGP is identified based on thepresence of at least one Loop domain, also referred to herein as a‘loop’. As defined herein, the term “loop” includes an amino acidsequence having a length of at least about 4, preferably about 5-10,preferably about 10-20, and more preferably about 20-30, 30-40, 40-50,50-60, 60-70, 70-80, 80-90, 90-1100, or 100-150 amino acid residues, andhas an amino acid sequence that connects two transmembrane domainswithin a protein or polypeptide. Such loop regions may be located eitherextracellularly or in the cytoplasm. Accordingly, the N-terminal aminoacid of a loop is adjacent to a C-terminal amino acid of a transmembranedomain in a naturally-occurring SLGP or SLGP-like molecule, and theC-terminal amino acid of a loop is adjacent to an N-terminal amino acidof a transmembrane domain in a naturally-occurring SLGP or SLGP-likemolecule.

[0615] As used herein, a “cytoplasmic loop” includes an amino acidsequence located within a cell or within the cytoplasm of a cell. Alsoas used herein, an “extracellular loop” includes an amino acid sequencelocated outside of a cell, or extracellularly. For example, loop domainscan be found at about amino acid residues 453-464, 482-499, 525-532,554-569, 595-618, and 636-641 of SEQ ID NO:89.

[0616] In another embodiment of the invention, an SLGP is identifiedbased on the presence of a “C-terminal domain”, also referred to hereinas a C-terminal tail, in the sequence of the protein. As used herein, a“C-terminal domain” includes an amino acid sequence having a length ofat least about 10, preferably about 10-25, more preferably about 25-50,more preferably about 50-75, even more preferably about 75-100, 100-150,150-200, 200-250, 250-300, 300-400, 400-500, or 500-600 amino acidresidues and is located within a cell or extracellularly. Accordingly,the N-terminal amino acid residue of a “C-terminal domain” is adjacentto a C-terminal amino acid residue of a transmembrane domain in anaturally-occurring SLGP or SLGP-like protein. For example, a C-terminaldomain is found at about amino acid residues 667-690 of SEQ ID NO:89.

[0617] In another embodiment, an SLGP is identified based on thepresence of an “N-terminal domain”, also referred to herein as anN-terminal loop in the amino acid sequence of the protein. As usedherein, an “N-terminal domain” includes an amino acid sequence havingabout 1-500, preferably about 1-400, more preferably about 1-300, morepreferably about 1-200, even more preferably about 1-100, and even morepreferably about 1-50, 1-25, or 1-10 amino acid residues in length andis located outside of a cell orintracellularly. The C-terminal aminoacid residue of a “N-terminal domain” is adjacent to an N-terminal aminoacid residue of a transmembrane domain in a naturally-occurring SLGP orSLGP-like protein. For example, an N-terminal domain is found at aboutamino acid residues 1-432 of SEQ ID NO:89.

[0618] Accordingly in one embodiment of the invention, an SLGP includesat least one, preferably 6 or 7, transmembrane domains and and/or atleast one loop. In another embodiment, the SLGP further includes anN-terminal domain and/or a C-terminal domain. In another embodiment, theSLGP can include six transmembrane domains, three cytoplasmic loops, andtwo extracellular loops, or can include six transmembrane domains, threeextracellular loops, and 2 cytoplasmic loops. The former embodiment canfurther include an N-terminal domain. The latter embodiment can furtherinclude a C-terminal domain. In another embodiment, the SLGP can includeseven transmembrane domains, three cytoplasmic loops, and threeextracellular loops and can further include an N-terminal domain or aC-terminal domain.

[0619] In another embodiment, an SLGP is identified based on thepresence of at least one “7 transmembrane receptor profile”, alsoreferred to as a “Secretin family sequence profile”, in the protein orcorresponding nucleic acid molecule. As used herein, the term “7transmembrane receptor profile” includes an amino acid sequence havingat least about 50350, preferably about 100-300, more preferably about150-275 amino acid residues, or at least about 200-258 amino acids inlength and having a bit score for the alignment of the sequence to the7tm_(—)1 family Hidden Markov Model (HMM) of at least 20, preferably20-30, more preferably 30-40, more preferably 40-50, or 50-75 orgreater. The 7tm_(—)1 family HMM has been assigned the PFAM AccessionPF00001.

[0620] To identify the presence of a 7 transmembrane receptor profile inan SLGP, the amino acid sequence of the protein is searched against adatabase of HMMs (e.g., the Pfam database, release 2.1) using thedefault parameters. For example, the hmmsf program, which is availableas part of the HMMER package of search programs, is a family specificdefault program for PF00001 and a score of 15 is the default thresholdscore for determining a hit. For example, a search using the amino acidsequence of SEQ ID NO:89 was performed against the HMM databaseresulting in the identification of a 7 TM receptor profile in the aminoacid sequence of SEQ ID NO:89. The results of the search are set forthbelow. Score: 56.37 Seq: 421 678 Model: 75 348*ksYYyvvYiIYTVGYSMSiaaLlvAMfIFcfFRrLHCtRNYIHMNMFms+++Y+++  I  +G  +S++ L + +F F FF  +  TR +IH+N+  S SLGP 421IKDYNILTRITQLGIIISLICLAICIFTFWFFSEIQSTRTTIHKNLCCS 469FILRaisWFIkDWvlyWmYsndeltwHCwMsivwCRivMfFMQYMMMtNY  L A  +F++        +N            +C I     +Y+ ++ + SLGP 470LFL-AELVFLVGINT---NTNKL----------FCSIIAGLLHYFFLAAF 505FWMLvEGvYLHTLIvMtFFsERqYFWWYylIGWGfPlVFitiWvItRcyY WM +EG+ L+  +V      +   + +Y++G  +P+V ++  +   + Y SLGP 506AWMCIEGIHLYLIVVGVIYNKGFLHKNFYIFGYLSPAVVVGFSAALGYRY 555ENt..nCWDmNDnMwyWWIIrgPIMlsIvVNFFFFINIIRILMtKLRepq+ T   CW++++N ++ W  +GP  L I+ N++ F  II+ + + SLGP 556YGTTKVCWLSTEN-NFIWSFIGPACLIILGNLLAFGVIIYKVFRHTAGLK 604MgEndMqqYWRlvKSTLlLIPLFGIHYMVFaWrPdNhwlwqIYMYFElsl   +        + +   L  L+  +  +F  +      +++  Y+  + SLGP 605PEVSCF--ENIRSCARGALALLLLGTTWIFGGLHVV-HASVVTAYLFTVS 651iSFQGFFVAiIYCFcNhEVQmEIRRrW* + FQG+F   + C + +  Q+E  R SLGP 652NAFQGMFIFLFLCVLSRKIQEEYYRLF 678

[0621] Accordingly, in one embodiment of the invention, an SLGP proteinis a human SLGP protein having a 7 transmembrane receptor profile atabout amino acids 421-678 of SEQ ID NO:89. Such a 7 transmembranereceptor profile has the amino acid sequence:IKDYNILTRITQLGIIISLICLAICIFTFWFFSEIQSTRTTIHKNLCCSLFLAE (SEQ ID NO: 96)LVFLVGINTNTNKLFCSIIAGLLHYFFLAAFAWMCIEGIHLYLIVVGVIYNKGFLHKNFYIFGYLSPAVVVGFSAALGYRYYGTTKVCWLSTENNFIWSFIGPACLIILGNLLAFGVIIYKVFRHTAGLKPEVSCFENIRSCARGALALLLLGTTWIFGGLHVVHASVVTAYLFTVSNAFQGMFIFLFLCVLSRKIQEEYYRLF

[0622] Accordingly, SLGP proteins having at least 20-30%, 30-49%,40-50%, 50-60% homology, preferably about 60-70%, more preferably about70-80%, or about 80-90% homology with the 7 transmembrane receptorprofile of human SLGP (e.g., SEQ ID NO:89) are within the scope of theinvention.

[0623] In another embodiment, an SLGP is identified based on thepresence of a “EGF-like domain” in the protein or corresponding nucleicacid molecule. As used herein, the term “EGF-like domain” includes aprotein domain having an amino acid sequence of about 55-90, preferablyabout 60-85, more preferably about 65-80 amino acid residues, or about70-79 amino acids and having a bit score for the alignment of thesequence to the EGF-like domain (HMM) of at least 6, preferably 7-10,more preferably 10-30, more preferably 30-50, even more preferably50-75, 75-100, 100-200 or greater. The EGF-like domain HMM has beenassigned the PFAM Accession PF00008. Preferably, one or more cysteineresidues in the EGF-like domain are conserved among SLGP family membersor other proteins containing EGF-like domains (i.e., located in the sameor similar position as the cysteine residues in other SLGP familymembers or other proteins containing EGF-like domains). In a preferredembodiment, an “EGF-like domain” has the consensus sequenceX(4)-C-X(0,48)-C-X(3,12)-C-X(1,70)-C-X(1,6)-C-X(2)-G-a-X(0,21)-G-X(2)-C-X,(where C=conserved cysteine involved in a disulfide bond, G=oftenconserved glycine, a=often conserved aromatic acid, X=any residue);corresponding to SEQ ID NO:97. In another preferred embodiment, an“EGF-like domain” has the consensus sequence C—X—C—X(5)-G-X(2)—C, the 3C's are involved in disulfide bonds; corresponding to SEQ ID NO:98. Inanother preferred embodiment, an “EGF-like domain” has the consensussequence C—X-C-X(2)-[GP]-[FYW]-X(4,8)—C, the three C's are involved indisulfide bonds; corresponding to SEQ ID NO:99.

[0624] To identify the presence of an EGF-like domain in an SLGPprotein, make the determination that a protein of interest has aparticular profile, the amino acid sequence of the protein is searchedagainst a database of HMMs (e.g., the Pfam database, release 2.1) usingthe default parameters. For example, the hmmsf program, which isavailable as part of the HMMER package of search programs, is a familyspecific default program for PF00008 and a score of 15 is the defaultthreshold score for determining a hit. Alternatively, the thresholdscore for determining a hit can be lowered (e.g., to 8 bits). Adescription of the Pfam database can be found in Sonhammer et al. (1997)Proteins 28(3)405-420 and a detailed description of HMMs can be found,for example, in Gribskov et al. (1990) Meth. Enzymol. 183:146-159;Gribskov et al. (1987) Proc. Natl. Acad. Sci. USA 84:4355-4358; Krogh etal. (1994) J. Mol. Biol. 235:1501-1531; and Stultz et al. (1993) ProteinSci. 2:305-314, the contents of which are incorporated herein byreference. A search was performed against the HMM database resulting inthe identification of an EGF-like domain in the amino acid sequence ofSEQ ID NO:89. The results of the search, indicating that such a domainis found at residues 22 through 100 of SEQ ID NO:89, are set forthbelow: Score: 6.16 Seq: 22 53 Model: 1 34*CnpNPCmNgGtCvNtp.mYtCiCpeGYmyYtGrrC* C+ +PC+ +++C+       C C +G   ++GSLGP 22 CTKTPCLPNAKCEIRNGIEACYCNMG---FSGNGV 53 Score: 18.87 Seq: 62 100Model: 1 34 *CnpN..PCmNgGtCvNtp.mYtCiCpeGYm.y.YtGrrC*C ++   C +++ C+NT+ +Y+C C +G++ +  + R+ SLGP 62CGNLTQSCGENANCTNTEGSYYCMCVPGFRSSSNQDRFI 100

[0625] All amino acids are described using universal single letterabbreviations according to these motifs.

[0626] Such an EGF-like domain has the following amino acid sequence:

[0627]CTKTPCLPNAKCEIRNGIEACYCNMGFSGNGVCGNLTQSCGENANCTNTEGSYYCMCVPGFRSSSNQDRFI(SEQ ID NO:100)

[0628] Accordingly, SLGP proteins having at least 50-60% homology,preferably about 60-70%, more preferably about 70-80%, or about 80-90%homology with an EGF-like domain of human SLGP (e.g., SEQ ID NO:100) arewithin the scope of the invention.

[0629] In another embodiment, an SLGP is identified based on thepresence of a “NADH-ubiquinone/plastoquinone oxidoreductase chain 4Ldomain” in the protein or corresponding nucleic acid molecule. As usedherein, the term “NADH-ubiquinone/plastoquinone oxidoreductase chain 4Ldomain” includes a protein domain having an amino acid sequence of about25-55, preferably about 30-50, more preferably about 35-45 amino acidresidues, or about 40-43 amino acids and having a bit score for thealignment of the sequence to the NADH-ubiquinone/plastoquinoneoxidoreductase chain 4L domain (HMM) of at least 6, preferably 7-10,more preferably 10-30, more preferably 3050, even more preferably 50-75,75-100, 100-200 or greater. The NADH-ubiquinone/plastoquinoneoxidoreductase chain 4L domain HMM has been assigned the PFAM AccessionPF00420.

[0630] To identify the presence of a NADH-ubiquinone/plastoquinoneoxidoreductase chain 4L domain in an SLGP protein, make thedetermination that a protein of interest has a particular profile, theamino acid sequence of the protein is searched against a database ofHMMs (e.g., the Pfam database, release 2.1) using the defaultparameters. For example, the hmmsf program, which is available as partof the HMMER package of search programs, is a family specific defaultprogram for PF00420 and a score of 15 is the default threshold score fordetermining a hit. Alternatively, the threshold score for determining ahit can be lowered (e.g., to 8 bits). A description of the Pfam databasecan be found in Sonhammer et al. (1997) Proteins 28(3)405-420 and adetailed description of HMMs can be found, for example, in Gribskov etal. (1990) Meth. Enzymol. 183:146-159; Gribskov et al. (1987) Proc.Natl. Acad. Sci. USA 84:4355-4358; Krogh et al. (1994) J. Mol. Biol.235:1501-1531; and Stultz et al. (1993) Protein Sci. 2:305-314, thecontents of which are incorporated herein by reference. A search wasperformed against the HMM database resulting in the identification of aNADH-ubiquinone/plastoquinone oxidoreductase chain 4L domain in theamino acid sequence of SEQ ID NO:89. The results of the search,indicating that such a domain is found at residues 475 through 517 ofSEQ ID NO:89, are set forth below. Score: 6.77 Seq: 475 517 Model: 1 43*MMMMthYHFiIMIaFmmGIMGIlMNRsHmMSMLMCLEmMMLSl*   ++ + ++   +F+  I G+L +     ++ MC+E++ L L SLGP 475LVFLVGINTNTNKLFCSIIAGLLHYFFLAAFAWMCIEGIHLYL 517

[0631] All amino acids are described using universal single letterabbreviations according to these motifs.

[0632] Such a NADH-ubiquinone/plastoquinone oxidoreductase chain 4Ldomain has the amino acid sequence:

[0633] LVFLVGINTNTNKLFCSIIAGLLHYFFLAAFAWMCIEGIHLYL(SEQ ID NO:101)

[0634] Accordingly, SLGP proteins having at least 50-60% homology,preferably about 60-70%, more preferably about 70-80%, or about 80-90%homology with a NADH-ubiquinone/plastoquinone oxidoreductase chain 4Ldomain of human SLGP (e.g., SEQ ID NO:101) are within the scope of theinvention.

[0635] In another embodiment, an SLGP protein includes at least anEGF-like domain. In another embodiment, an SLGP protein includes atleast an NADH-ubiquinone/plastoquinone oxidoreductase chain 4L domain.In another embodiment, an SLGP protein includes at least a 7transmembrane receptor profile. In another embodiment, an SLGP proteinincludes an EGF-like domain, and an NADH-ubiquinone/plastoquinoneoxidoreductase chain 4L domain. In another embodiment, an SLGP proteinincludes an EGF-like domain and a 7 transmembrane receptor profile. Inanother embodiment, an SLGP protein includes an EGF-like domain, and anNADH-ubiquinone/plastoquinone oxidoreductase chain 4L domain, and a 7transmembrane receptor profile.

[0636] In another embodiment, an SLGP protein includes anNADH-ubiquinone/plastoquinone oxidoreductase chain 4L domain and a 7transmembrane receptor profile. In another embodiment, an SLGP proteinis human SLGP which includes an EGF-like domain having about amino acids22-100 of SEQ ID NO:89. In another embodiment, an SLGP protein is humanSLGP which includes an NADH-ubiquinone/plastoquinone oxidoreductasechain 4L domain having about amino acids 475-517 of SEQ ID NO:89. Inanother embodiment, an SLGP protein is human SLGP which includes a 7transmembrane receptor profile having about amino acids 421-678 of SEQID NO:89.

[0637] In yet another embodiment, an SLGP protein is human SLGP whichincludes a an EGF-like domain having about amino acids 22-100 of SEQ IDNO:89, an NADH-ubiquinone/plastoquinone oxidoreductase chain 4L domainhaving about amino acids 475-517 of SEQ ID NO:89, and a 7 transmembranereceptor profile having about amino acids 421-678 of SEQ ID NO:89.

[0638] Preferred SLGP molecules of the present invention have an aminoacid sequence sufficiently homologous to the amino acid sequence of SEQID NO:89 or SEQ ID NO:105. As used herein, the term “sufficientlyhomologous” refers to a first amino acid or nucleotide sequence whichcontains a sufficient or minimum number of identical or equivalent(e.g., an amino acid residue which has a similar side chain) amino acidresidues or nucleotides to a second amino acid or nucleotide sequencesuch that the first and second amino acid or nucleotide sequences sharecommon structural domains and/or a common functional activity. Forexample, amino acid or nucleotide sequences which share commonstructural domains have at least about 50% homology, preferably 60%homology, more preferably 70%-80%, and even more preferably 90-95%homology across the amino acid sequences of the domains and contain atleast one and preferably two structural domains, are defined herein assufficiently homologous. Furthermore, amino acid or nucleotide sequenceswhich share at least 50%, preferably 60%, more preferably 70-80, or90-95% homology and share a common functional activity are definedherein as sufficiently homologous.

[0639] As used interchangeably herein, an “SLGP activity”, “biologicalactivity of SLGP” or “functional activity of SLGP”, refers to anactivity exerted by an SLGP protein, polypeptide or nucleic acidmolecule on an SLGP responsive cell as determined in vivo, or in vitro,according to standard techniques. In one embodiment, an SLGP activity isa direct activity, such as an association with a SLGP-target molecule.As used herein, a “target molecule” or “binding partner” is a moleculewith which an SLGP protein binds or interacts in nature, such thatSLGP-mediated function is achieved. An SLGP target molecule can be anon-SLGP molecule or an SLGP protein or polypeptide of the presentinvention. In an exemplary embodiment, an SLGP target molecule is anSLGP ligand. Alternatively, an SLGP activity is an indirect activity,such as a cellular signaling activity mediated by interaction of theSLGP protein with an SLGP ligand.

[0640] In a preferred embodiment, an SLGP activity is at least one ormore of the following activities: (i) interaction of an SLGP proteinwith soluble SLGP ligand (e.g., CD55); (ii) interaction of an SLGPprotein with a membrane-bound non-SLGP protein; (iii) interaction of anSLGP protein with an intracellular protein (e.g., an intracellularenzyme or signal transduction molecule); (iv) indirect interaction of anSLGP protein with an intracellular protein (e.g., a downstream signaltransduction molecule); and (v) modulation of cellular proliferation,growth, differentiation, or migration. In yet another preferredembodiment, an SLGP activity is at least one or more of the followingactivities: (1) modulation of cellular signal transduction, either invitro or in vivo; (2) regulation of activation in a cell expressing anSLGP protein exposure to alpha-latrotoxin); (3) regulation ofinflammation; or (4) modulation of angiogenesis (e.g., proliferation,elongation, and migration of endothelial cells (e.g. tumor endothelialcells), to form new vessels).

[0641] Accordingly, another embodiment of the invention featuresisolated SLGP proteins and polypeptides having an SLGP activity.Preferred SLGP proteins have at least one transmembrane domain and anSLGP activity. In a preferred embodiment, an SLGP protein has a 7transmembrane receptor profile and an SLGP activity. In anotherpreferred embodiment, an SLGP protein has an EGF-like domain and an SLGPactivity. In another preferred embodiment, an SLGP protein has anNADH-ubiquinone/plastoquinone oxidoreductase chain 4L domain and an SLGPactivity. In still another preferred embodiment, an SLGP protein has a 7transmembrane receptor profile, an EGF-like domain, and SLGP activity.In still another preferred embodiment, an SLGP protein has a 7transmembrane receptor profile, an EGF-like domain, and anNADH-ubiquinone/plastoquinone oxidoreductase chain 4L domain and an SLGPactivity. In still another preferred embodiment, an SLGP protein has a 7transmembrane receptor profile and an NADH-ubiquinone/plastoquinoneoxidoreductase chain 4L domain and an SLGP activity. In still anotherpreferred embodiment, an SLGP protein has an EGF-like domain and anNADH-ubiquinone/plastoquinone oxidoreductase chain 4L domain and an SLGPactivity. In still another preferred embodiment, an SLGP protein has a 7transmembrane receptor profile, an EGF-like domain, an SLGP activity,and an amino acid sequence sufficiently homologous to an amino acidsequence of SEQ ID NO:89 or SEQ ID NO:105.

[0642] An alignment of the amino acid sequences of human SLGP (SEQ IDNO:89) and human CD 97 (Accession No. U76764, SEQ ID NO:102) generatedutilizing the ALIGN program with the following parameter setting:PAM120, gap penalties: −12/−4 (Myers, E. and Miller, W. (1988) “OptimalAlignments in Linear Space” CABIOS 4:11-17) demonstrated a 27.9%identity between the two sequences.

[0643] An alignment of the nucleotide sequences of human SLGP (SEQ IDNO:88) and human CD 97 (Accession No. U76764, SEQ ID NO:103) generatedutilizing the ALIGN program with the following parameter setting:PAM120, gap penalties: −12/−4 (Myers, E. and Miller, W. (1988) “OptimalAlignments in Linear Space” CABIOS 4:11-17) demonstrated a 41.8%identity between the two sequences.

[0644] The nucleotide sequence of the isolated human SLGP cDNA and thepredicted amino acid sequence of the human SLGP polypeptide are shown inSEQ ID NOs:88 and 89, respectively.

[0645] The human SLGP cDNA, which is approximately 2987 nucleotides inlength (SEQ ID NO:88), encodes a protein which is approximately 690amino acid residues in length (SEQ ID NO:89).

[0646] The nucleotide sequence of the isolated mouse SLGP cDNA and thepredicted amino acid sequence of the mouse SLGP polypeptide are shown inSEQ ID NOs:104 and 105, respectively.

[0647] The mouse SLGP cDNA, which is approximately 3952 nucleotides inlength (SEQ ID NO:104), encodes a protein which is approximately 689amino acid residues in length (SEQ ID NO:105).

[0648] Plasmids containing the nucleotide sequence encoding human andmouse SLGP were deposited with the American Type Culture Collection(ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209, on ______and assigned Accession Numbers ______ and ______. This deposit will bemaintained under the terms of the Budapest Treaty on the InternationalRecognition of the Deposit of Microorganisms for the Purposes of PatentProcedure. These deposits were made merely as a convenience for those ofskill in the art and is not an admission that a deposit is requiredunder 35 U.S.C. §112. In accordance with 37 CFR 1.808(a), access to thedeposits will be available during pendancy of the instant application toone determined by the Commission to be entitled thereto under §1.14 and35 USC §122. The deposits will irrevocably and without restriction orcondition be released to the public upon grant of a patent on thisapplication.

[0649] Isolation of the Human and Mouse SLGP cDNAs

[0650] In order to identify novel secreted and/or membrane-boundproteins, a program termed ‘signal sequence trapping’ was utilized toanalyze the sequences of several cDNAs of a cDNA library derived frombronchial epithelial cells which had been stimulated with the cytokine,TNFα. This analysis identified a human clone having an insert ofapproximately 3 kb containing a protein-encoding sequence ofapproximately 2987 nucleotides capable of encoding approximately 690amino acids of SLGP (e.g., the starting methionine through residue 690of, for example, SEQ ID NO:89).

[0651] The nucleotide sequence encoding the human SLGP protein is setforth as SEQ ID NO:88. The full length protein encoded by this nucleicacid is comprised of about 690 amino acids and has the amino acidsequence set forth as SEQ ID NO:89. The coding portion (open readingframe) of SEQ ID NO:88 is set forth as SEQ ID NO:90.

[0652] The nucleotide sequence encoding the mouse SLGP protein is setforth as SEQ ID NO:104. The full length protein encoded by this nucleicacid is comprised of about 689 amino acids and has the amino acidsequence set forth as SEQ ID NO:105. The coding portion (open readingframe) of SEQ ID NO:104 is set forth as SEQ ID NO:106.

[0653] Analysis of Human SLGP

[0654] A BLAST search (Altschul et al. (1990) J. Mol. Biol. 215:403) ofthe nucleotide sequence of human SLGP has revealed that SLGP issignificantly similar to a protein identified as human CD 97 (AccessionNo. U76764; SEQ ID NO:102 ans 103) and to a protein identified as ratlatrophilin (Accession Nos. U78105, U72487).

[0655] The SLGP proteins of the present invention contain a significantnumber of structural characteristics of the GPCR family. For instance,the SLGPs of the present invention contain conserved cysteines found inthe first 2 loops (prior to the third and fifth transmembrane domains)of most GPCRs (cys490 and cys562 of SEQ ID NO:89). A highly conservedasparagine residue is present (asn125 in SEQ ID NO:89). SLGP proteinscontains a highly conserved leucine (leu154 of SEQ ID NO:89). The twocysteine residues are believed to form a disulfide bond that stabilizesthe functional protein structure. A highly conserved asparagine andarginine in the fourth transmembrane domain of the SLGP proteins ispresent (asp158 and arg218 of SEQ ID NO:89). Moreover, a highlyconserved proline is present (pro307 of SEQ ID NO:89). Proline residuesin the fourth, fifth, sixth, and seventh transmembrane domains arethought to introduce kinks in the alpha-helices and may be important inthe formation of the ligand binding pocket. Moreover, a conservedtyrosine is present in the seventh transmembrane domain of SLGP-2(tyr647 of SEQ ID NO:89).

[0656] As such, the SLGP family of proteins, like the Secretin family ofproteins, are referred to herein as G protein-coupled receptor-likeproteins.

[0657] SLGP is predicted to contain the following sites: N-glycosylationsite at residues 15-18, residues 21-24, residues 64-67, residues 74-77,residues 127-130, residues 177-180, residues 188-191, residues 249-252,residues 381-384, and at residues 395-398 of SEQ ID NO:89;Glycosaminoglycan attachment site at residues 49-52 of SEQ ID NO:89;cAMP- and cGMP-dependent protein kinase phosphorylation sites atresidues 360-363 of SEQ ID NO:89; Protein kinase C phosphorylation sitesat residues 135-137, residues 181-183, residues 233-235, residues358-360, residues 363-365, residues 400-402, residues 457-459, residues485-487, residues 558-560, and residues 667-669 of SEQ ID NO:89; Caseinkinase II phosphorylation sites at residues 54-57, residues 68-71,residues 76-79, residues 94-97, residues 135-138, residues 150-153,residues 155-158, residues 161-164, residues 181-184, residues 190-193,residues 244-247, residues 310-313, residues 325-328, residues 346-349,and at residues 608-611 of SEQ ID NO:89; Tyrosine kinase phosphorylationsite at residues 36-43, and residues 668-675 of SEQ ID NO:89;N-myristoylation sites at residues 38-43, residues 50-55, residues80-85, residues 382-387, residues 388-393, residues 434-439, residues480-485, residues 521-526, residues 584-589, and at residues 619-624 ofSEQ ID NO:89; Aspartic acid and asparagine hydroxylation at residues75-86 of SEQ ID NO:89, EF-hand calcium-binding domain at residues153-165 of SEQ ID NO:89.

[0658] Tissue Distribution of SLGP mRNA by Northern Blot Hybridization

[0659] This Example describes the tissue distribution of SLGP mRNA, asdetermined by Northern blot hybridization.

[0660] Northern blot hybridizations with the various RNA samples wereperformed (Clontech Human Multi-tissue Northern I and a human normal anddiseased heart tissue northern) under standard conditions and washedunder stringent conditions. A 3.2 Kb and a 4.2 Kb mRNA transcript wasdetected in all tissues tested (heart, brain, placenta, lung, liver,skeletal muscle, kidney, pancreas), with the highest expression inheart. Specifically, the expression was found to be localized toendothelial cells in the heart. Additionally, these transcripts werefound in both normal and diseased hearts.

[0661] Tissue Distribution Analysis of Human and Mouse SLGP cDNA

[0662] The following describes the tissue distribution of human andmouse SLGP cDNA, as determined using the TaqMan™ procedure.

[0663] The results from these analyses showed that Human SLGP isupregulated in tube forming Human Microvascular Endothelial Cells(HMVEC) and in proliferating HMVEC as compared to arresting HMVEC. HumanSLGP is also upregulated in glioblastomas as compared to normal brain.

[0664] Additionally, mouse SLGP was shown to be upregulated inVEGF-induced angiogenic xenograft plugs as compared to parental plugs.

[0665] In Situ Hybridization Analysis of Human SLGP

[0666] The following describes the tissue distribution of human SLGP asdetermined using in situ hybridization analysis. For in situ analysis,tissues, e.g. brain and glioblastoma tissues, were first frozen on dryice.

[0667] In situ hybridization results show that the human SLGP gene isexpressed in endothelial cells of glioblastomas but not in endothelialcells of normal brains.

[0668] Analysis of Human and Mouse SLGP Expression

[0669] The following describes the expression of human and mouse SLGP asdetermined by transcriptional profiling experiments. Expression of humanSLGP in proliferating HMVEC and arresting HMVEC was analyzed bytranscriptional profiling. The results from this analysis demonstratethat human SLGP is up-regulated in proliferating HMVEC as compared toarresting HMVEC.

[0670] Expression of mouse SLGP in VEGF-induced angiogenic plugs andparental xenografts was also analyzed by transcriptional profiling.These resuts demonstrated that mouse SLGP expression is up-regulated inVEGF-induced angiogenic xenograft plugs as compared to parentalxenografts.

[0671] Human 38555 and 593

[0672] The present invention is based, at least in part, on thediscovery of human cDNA molecules which encode proteins which are hereindesignated 38555 (or 38555) and 593. The invention is also based on thediscovery that the protein encoded by a previously described (butotherwise non-characterized) human brain cDNA clone is, or isfunctionally analogous to, a prostaglandin and thromboxane transmembranetransport protein. These three proteins are integral membrane proteinsthat facilitate transmembrane transport of charged organic compoundssuch as one or more of prostaglandins, thromboxanes, hexoses,disaccharides, hormones (e.g. insulin), peptides, neurotransmitters,cytokines, chemokines, and the like. The characteristics of each ofthese proteins and the cDNAs encoding them are now described separately.

[0673] Protein 38555

[0674] A cDNA encoding at least a portion of human 38555 protein wasisolated from a library of human cDNA clones on the basis of homology tothe amino terminal portion of the protein designated ‘humanprostaglandin transporter’ (HPT) in the literature (U.S. Pat. No.5,792,851; Lu et al. (1996) J. Clin. Invest. 98:1142-1149; Kanai et al.(1995) Science 268:866-869). Human protein 38555 is predicted bystructural analysis to be a transmembrane transporter protein havingtwelve transmembrane domains.

[0675] The full length of the cDNA encoding human protein 38555 (SEQ IDNO:107) is 2563 nucleotide residues. The ORF of this cDNA, nucleotideresidues 42 to 1970 of SEQ ID NO:107 (i.e. SEQ ID NO:109), encodes a643-amino acid protein (SEQ ID NO:108) which exhibits amino acidsequence homology with HPT protein and other prostaglandin transporters.The human 38555 genomic sequence is shown as nucleotide residues1-50,000 in SEQ ID NO:110 and nucleotide residues 50,001-31,124 in SEQID NO:118. The gene encoding human protein 38555 maps to humanchromosome 15 at q26.1. A PAC clone including this region has beensequenced, and the sequence of that clone is listed in GenBank Accessionnumber AC005319. It was not previously recognized that any protein, letalone protein 38555 was encoded within the portion of the genomeencompassed by the PAC clone. The exon and intron structure of thegenomic sequence is described in Tables 49 and 50. Table 49 lists thepositions of exons in this sequence, and Table 50 lists intron positionsand branch sites (bold residues in Table 50 indicate RNA splicingjunctions. TABLE 49 Corresponding Amino Position within Position withinAcid Sequence Exon SEQ ID SEQ ID NO: (Residues of Designation NO: 107110/118 SEQ ID NO: 109) a 541-639 3683-3781 168-199 b 640-90313078-13341 200-287 c  904-1068 29276-29440 288-342 d 1069-126734872-35070 343-408 e 1268-1406 37163-37301 409-455 f 1407-158255668-55843 456-513 g 1583-1647 59634-59698 514-535 h 1648-189071440-71682 536-616 i 1891-2546 80469-81124 617-643

[0676] TABLE 50 Position Intron in SEQ Donor Branch Desig- ID NO: SiteAcceptor Site Site(s) nation 110/118 Sequence Sequence (TACTAAC) i   0-3682 TCAG ii  3782-13077 GTAA ACAG 7141-7147 iii 13342-29275 GTAAGCAG iv 29441-34871 GTGA CCAG v 35071-37162 GTGA CCAG vi 37302-55667GTAA TCAG  39794-39800, 52196-52202 vii 55844-59633 GTAA GTAG viii59699-71439 GTAT ACAG ix 71683-80468 GTGA TTAG

[0677] In addition to full length human protein 38555, the inventionincludes fragments, derivatives, and variants of protein 38555, asdescribed herein. These proteins, fragments, derivatives, and variantsare collectively referred to herein as polypeptides of the invention orproteins of the invention.

[0678] The invention also includes nucleic acid molecules which encode apolypeptide of the invention. Such nucleic acids include, for example, aDNA molecule having the nucleotide sequence listed in SEQ ID NO:107 orsome portion thereof, such as the portion which encodes human protein38555, or a domain, fragment, derivative, or variant of protein 38555.These nucleic acids are collectively referred to as nucleic acids of theinvention.

[0679] 38555 proteins of the invention and nucleic acid moleculesencoding them comprise a family of molecules having certain conservedstructural and functional features, as indicated by the conservation ofamino acid sequence between protein 38555 and HPT (SEQ ID NO:116), thehuman OatP sodium-independent organic anion transporter protein (GenBankAccession no. P46721; SEQ ID NO:117), human KIAA0880 protein (GenBankAccession no. 4240248; SEQ ID NO:115), and human protein 593 (asdescribed herein, SEQ ID NO:113).

[0680] 38555 proteins typically comprise a variety of potentialpost-translational modification sites (often within an extracellulardomain), such as those described herein in Table 51, as predicted bycomputerized sequence analysis of human 38555 protein using amino acidsequence comparison software (comparing the amino acid sequence ofprotein 38555 with the information in the PROSITE database {rel. 12.2;Feb, 1995} and the Hidden Markov Models database {Rel. PFAM 3.3}). Incertain embodiments, a protein of the invention has at least 1, 2, 4, 6,8, 10, 15, or 20 or more of the post-translational modification siteslisted in Table 51. TABLE 51 Type of Potential Modification Site AminoAcid Residues of Amino Acid or Domain SEQ ID NO: 108 SequenceN-glycosylation site 104 to 107 NGSG 120 to 123 NRTA 332 to 335 NLTT 408to 41  NSTA 453 to 456 NSTN 470 to 473 NATV cAMP- or cGMP-dependentprotein 159 to 162 RKDS kinase phosphorylation site 362 to 365 KKLSProtein kinase C phosphorylation site 256 to 258 SER 625 to 627 TEKCasein kinase II phosphorylation site 16 to 19 TTLE 34 to 37 SSFE 106 to109 SGGD 151 to 154 SYID 200 to 203 SNLD 205 to 208 TPDD 256 to 259 SERE414 to 417 SALD 616 to 619 TSTE 628 to 631 TCPE 634 to 637 SPSE Tyrosinekinase phosphorylation site 158 to 165 RRKDSSLY N-myristoylation site 30to 35 GVIASS 64 to 69 GIVMAL 70 to 75 GALLSA 167 to 172 GILFTM 184 to189 GSFCTK 213 to 218 GAWWGG 353 to 358 GIFLGG 451 to 456 GCNSTN 482 to487 GCQEAF 547 to 552 GIDSTC 612 to 617 GGLSTS Sugar (or other)transport domain   2 to 446 Kazal domain 426 to 460

[0681] Protein 38555 comprises domains which exhibit homology with knownsugar (or other) transport domains and with Kazal domains. In oneembodiment, the protein of the invention has at least one domain that isat least 55%, preferably at least about 65%, more preferably at leastabout 75%, yet more preferably at least about 85%, and most preferablyat least about 95% identical to one of these domains. Preferably, theprotein of the invention has at least two domains, each of which is atleast 55%, preferably at least about 65%, more preferably at least about75%, yet more preferably at least about 85%, and most preferably atleast about 95% identical to either the sugar (or other) transportdomain or the Kazal domain of protein 38555.

[0682] Sugar (or other) transport domains occur in a variety of proteinsinvolved in transmembrane transport of sugars and other metabolites.Other proteins which comprise such a domain include human glucosetransporters GLUT1, GLUT2, GLUT3, GLUT4, GLUT5, GLUT6, and GLUT7,Escherichia coli proteins AraE (arabinose-proton symporter), GalP(galactose-proton symporter), citrate-proton symport protein, KgtP(α-ketoglutarate permease), ProP (proline/betaine transporter), and XylE(xylose-proton symporter), Escherichia coli hypothetical proteins YabE,YdjE, and YhjE, Klebsiella pneumoniae citrate-proton symport protein,Zymomonas mobilis glucose facilitated diffusion protein, yeast high andlow affinity glucose transport proteins (SNF3 and HXT1 through HXT14),yeast galactose transporter, yeast maltose permease, yeast myo-inositoltransporter, yeast carboxylic acid transporter homolog JEN1, yeasthypothetical proteins YBR241c, YCR98c, and YFL040w, Klyveromyces lactislactose permease, Neurospora crassa quinate transporter, Emericellanidulans quinate permease, Chlorella hexose carrier, Arabidopsisthaliana glucose transporter, spinach sucrose transporter, Leishmaniadonovani transporters D1 and D2, Leishmania enriettii probable transportprotein LTP, Caenorhabditis elegans hypothetical protein ZK637.1,Haemophilus influenzae hypothetical proteins H10281 and H10418, andBacillus subtilis hypothetical proteins YxbC and YxdF. Occurrence of asugar (or other) transport domain in protein 38555 indicates thatprotein 38555 is involved in transmembrane transport of one or morecompounds, most likely a compound having a molecular weight on the orderof a hexose or greater (i.e. having a molecular weight greater thanabout 180). Examples of such compounds include prostaglandins,thromboxanes, hexoses, disaccharides, hormones (e.g. insulin), peptides,neurotransmitters, cytokines, chemokines, and the like. Protein 38555thus mediates one or more of facilitated diffusion and symport orantiport (e.g. involving co-transport of a proton, a sodium ion, apotassium ion, or another physiological ion).

[0683] Kazal domains occur frequently in serine protease inhibitors.However, these domains also occur as extracellular domains in agrins,which are not thought to have roles as protease inhibitors. Thesedomains are characterized by occurrence, preferably within anextracellular domain, of the consensus pattern

[0684] C-X_((7 or 8))-C-X₆-Y-X₃-C-X_((2 or 3))-C-(SEQ ID NO:119)

[0685] wherein standard single-letter amino acid residue codes are used,X being any amino acid residue, and subscripts referring to the numberof residues. Agrins are involved in organization of neural synapses,including, for example, inter-neuronal synapses within the centralnervous system (e.g. glutamatergic synapses) and neuromuscular junctions(Martin and Sanes (1997) Development 124:3909-3917; Lieth and Fallon(1993) J. Neurosci. 13:2509-2514). Agrins are also involved inorganization of endothelial cells and astrocytes during formation andmaintenance of the blood brain barrier. Thus, occurrence of a Kazaldomain in protein 38555 indicates that this protein is involved information and maintenance of cell-to-cell interactions, and moreparticularly that the protein is involved in forming and maintainingneural synapses, including both neuron-to-neuron synapses andneuron-to-non-neural cell synapses (e.g. neuromotor and neuroendocrinesynapses).

[0686] Human protein 38555 exhibits sequence similarity to HPT (GenBankAccession no. Q92959). An alignment of the amino acid sequences of humanprotein 38555 (SEQ ID NO:108) and HPT (SEQ ID NO:116) made using theALIGN program of the GCG software package, pam120.mat scoring matrix,gap penalties −12/−4, demonstrates that the amino acid sequences of theproteins are 32.4% identical.

[0687] Protein 38555 is predicted by computerized amino acid sequenceanalysis (using the MEMSAT computer program) to be atwelve-transmembrane region integral membrane protein havingtransmembrane regions at approximately the following positions withinSEQ ID NO:108: from about amino acid residue 8 to about residue 17; fromabout amino acid residue 29 to about residue 52; from about amino acidresidue 59 to about residue 76; from about amino acid residue 129 toabout residue 153; from about amino acid residue 164 to about residue186; from about amino acid residue 215 to about residue 236; from aboutamino acid residue 301 to about residue 324; from about amino acidresidue 341 to about residue 361; from about amino acid residue 374 toabout residue 392; from about amino acid residue 490 to about residue513; from about amino acid residue 524 to about residue 548; and fromabout amino acid residue 575 to about residue 592.

[0688] Extracellular domains are predicted to include approximatelyamino acid residues 18 to 28, 77 to 128, 187 to 214, 325 to 340, 393 to489, and 549 to 574 of SEQ ID NO:108. Intracellular domains arepredicted to include approximately amino acid residues 1 to 7, 53 to 58,154 to 163, 237 to 300, 362 to 373, 514 to 523, and 593 to 643 of SEQ IDNO:108.

[0689] Human protein 38555 can have additional amino acid residues atthe amino terminal end of the sequence listed in SEQ ID NO:108 (i.e. theprotein can have an additional portion at its amino terminus). Forexample, protein 38555 can have 1, 2, 4, 6, 10, 15, 20, 25, or 30 ormore additional amino acid residues at the amino terminus indicated inSEQ ID NO:108.

[0690] As described elsewhere herein, relatively hydrophilic regions aregenerally located at or near the surface of a protein, and are morefrequently effective immunogenic epitopes than are relativelyhydrophobic regions. For example, the region of human protein 38555 fromabout amino acid residue 415 to about amino acid residue 430 appears tobe located at or near the surface of the protein, while the region fromabout amino acid residue 440 to about amino acid residue 450 appears notto be located at or near the surface.

[0691] The predicted molecular weight of human protein 38555 is about69.2 kilodaltons.

[0692] A monkey cDNA clone having significant homology with the humancDNA clone encoding protein 38555 was isolated from a monkey brain cDNAlibrary, indicating that human protein 38555 is expressed in braintissue, although it can, of course, be expressed in other tissues aswell.

[0693] Biological Function of Human 38555 Proteins, Nucleic AcidsEncoding Them, and Modulators of These Molecules

[0694] Human 38555 proteins are involved in disorders which affect bothtissues in which they are normally expressed and tissues in which theyare normally not expressed. Based on the observation that 38555 proteinis expressed in monkey-brain and is therefore likely expressed in humanbrain tissue, human 38555 protein is involved in one or more biologicalprocesses which occur in brain and other neurological tissues. Inparticular, 38555 is involved in modulating growth, proliferation,survival, differentiation, and activity of cells including, but notlimited to, central nervous system neurons, peripheral nervous systemneurons, motor neurons, sensory neurons, and sympathetic andparasympathetic neural cells of the animal in which it is normallyexpressed. Protein 38555 is also involved in mediating interactionsbetween cells, particularly between two neurons or between a neuron anda non-neuronal cell such as a muscle or endocrine cell. Thus, 38555protein has a role in disorders which affect neuronal cells and cellswhich interact with neurons and their growth, proliferation, survival,differentiation, and activity.

[0695] Widespread expression of 38555 has been detected among humantissue types. Thus, the growth-, proliferation-, survival-,differentiation-, and activity-modulating activities of 38555 proteinaffect cells of many types. Thus, protein 38555 can affect cell-to-cellinteractions in a wide variety of cell types.

[0696] The presence of the sugar (or other) transport domain in protein38555 indicates that this protein is involved in transmembrane transportof one or more charged organic compounds such as prostaglandins,thromboxanes, neurotransmitters, hormones, small peptides, shortpolysaccharides (e.g. disaccharides), and the like. The proteins of theinvention are therefore involved in one or more disorders relating toinappropriate uptake or release of such molecules (i.e. includinginappropriate failure to take up or release such molecules). Protein38555 is thus involved in one or more of a variety of cellular uptakeand release disorders such as diabetes, nutritional disorders (e.g.vitamin deficiencies, and malnutrition), metabolic disorders (e.g.obesity, porphyrias, hyper- and hypolipoproteinemia, lipidoses, andwater, electrolyte, mineral, and acid/base imbalances), and neuraltransmission disorders (e.g. inappropriate pain, dementia, multiplesclerosis, nerve root disorders, Alzheimer's disease, Parkinson'sdisease, depression, physical and psychological substance addiction,sexual dysfunction, schizophrenic disorders, delusional disorders, mooddisorders, sleep disorders, and the like).

[0697] Occurrence of a Kazal domain in human protein 38555 furtherimplicates this protein in neuronal development and transmission. Thepresence of this domain therefore indicates that 38555 protein isinvolved in disorders relating to inappropriate formation (i.e.including failure to form) and maintenance (i.e. includingdeterioration) of neuronal synapses, including both neuron-to-neuronsynapses and neuron-to-non-neuronal cell synapses. Thus, in addition tothe neural transmission disorders described above, protein 38555 is alsoimplicated in disorders such as stroke, regeneration of chronically ortraumatically damaged neuronal structures (including nerve, brain, andspinal cord), developmental neuronal disorders (e.g. spina bifida),neuronal cancers (e.g. gliomas, astrocytomas, ependymomas, pituitaryadenomas, and the like), peripheral nerve deficit, cardiacinsufficiency, and the like.

[0698] The observation that human protein 38555 shares sequence homologywith proteins involved in transmembrane prostaglandin transportindicates that 38555 protein has activity identical or analogous to theactivity of those proteins, i.e. that 38555 catalyzes or facilitatestransmembrane transport of one or more prostaglandins, thromboxanes,other hormones or hormone-like molecules, or other charged organiccompounds. Exemplary molecules which can be transported across cellmembranes via protein 38555 include one or more charged organiccompounds such as prostaglandins A₁, A₂, B₁, B₂, D₂, E₁, E₂, F_(1α),F_(2α), G₂, H₂, I₂, and J₂ and thromboxanes A₂ and B₂. Uptake andrelease of prostaglandins and thromboxanes, for example, are known to beinvolved in a variety of physiological processes and disorders includingglaucoma, ovum fertilization, sperm motility, pregnancy, labor,delivery, abortion, gastric protection, peptic ulcer formation,intestinal fluid secretion, liver protection, liver damage, liverfibrosis, pain stimulation, glomerular filtration, maintenance of bodytemperature, fever, airway resistance, asthma, chronic obstructivepulmonary disorder, modulation of blood pressure, hypertension, shock,modulation of inflammation, platelet aggregation, abnormal bloodcoagulation, atherosclerosis, arteriosclerosis, and coronary arterydisease. Thus, polypeptides and nucleic acid molecules of the invention,and compounds which bind with or modulate one or more polypeptides andnucleic acid molecules of the invention can be used to prognosticate,diagnose, inhibit, or treat one or more of the disorders listed above orone or more disorders associated with the physiological processes listedabove.

[0699] Protein 593

[0700] A cDNA encoding at least a portion of human 593 protein wasidentified by assembling isolated sequences derived from a library ofhuman cDNA clones on the basis of homology with the nucleic acidsequence encoding human protein 38555. Human protein 593 is predicted bystructural analysis to be a transmembrane transporter protein havingtwelve transmembrane domains.

[0701] The full length of the cDNA encoding human protein 593 (SEQ IDNO:1 μl) is 2276 nucleotide residues. The ORF of this cDNA, nucleotideresidues 1 to 1836 of SEQ ID NO:111 (SEQ ID NO:113), encodes a 612-aminoacid protein (SEQ ID NO:112) which exhibits amino acid sequence homologywith human protein 38555 and other prostaglandin transporters.

[0702] In addition to full length human protein 593, the inventionincludes fragments, derivatives, and variants of protein 593, asdescribed herein. These proteins, fragments, derivatives, and variantsare collectively referred to herein as polypeptides of the invention orproteins of the invention.

[0703] The invention also includes nucleic acid molecules which encode apolypeptide of the invention. Such nucleic acids include, for example, aDNA molecule having the nucleotide sequence listed in SEQ ID NO:111 orsome portion thereof, such as the portion which encodes human protein593, or a domain, fragment, derivative, or variant of protein 593. Thesenucleic acids are collectively referred to as nucleic acids of theinvention.

[0704] Human 593 proteins of the invention and nucleic acid moleculesencoding them comprise a family of molecules having certain conservedstructural and functional features, as indicated by the close homologyof human protein 593 (SEQ ID NO:112) to HPT (SEQ ID NO:117), the humanOatP sodium-independent organic anion transporter protein (GenBankAccession no. P46721; SEQ ID NO:116), human KIAA0880 protein (GenBankAccession no. 4240248; SEQ ID NO:115), and human protein 38555 (asdescribed herein, SEQ ID NO:108).

[0705] Human 593 proteins typically comprise a variety of potentialpost-translational modification sites (often within an extracellulardomain), such as those described herein in Table 52, as predicted bycomputerized sequence analysis of human 593 protein using amino acidsequence comparison software (comparing the amino acid sequence ofprotein 593 with the information in the PROSITE database {rel. 12.2;Feb, 1995} and the Hidden Markov Models database {Rel. PFAM 3.3}). Incertain embodiments, a protein of the invention has at least 1, 2, 4, 6,8, 10, 15, or 20 or more of the post-translational modification siteslisted in Table 52. TABLE 52 Type of Potential Modification Site AminoAcid Residues of Amino Acid or Domain SEQ ID NO: 112 SequenceN-glycosylation site 389 to 392 NLTA 447 to 450 NLSS Protein kinase Cphosphorylation site 228 to 230 SQR 245 to 247 SSR 258 to 260 TIR 296 to298 SPK 492 to 494 TLR Casein kinase II phosphorylation site 19 to 22TSLE 37 to 40 SSYD 140 to 143 TYLD 246 to 249 SRGE 251 to 254 SNPD 258to 261 TIRD 307 to 310 SASE 430 to 433 TNVD 598 to 601 SAPD 602 to 605SATD Tyrosine kinase phosphorylation site 23 to 30 RRYDLHSYN-myristoylation site  7 to 12 GMTVNG 33 to 38 GLIASS 103 to 108 GAVCAD174 to 179 GALLNI 206 to 211 GSGAAA 282 to 287 GATEAT 323 to 328 GGGGTF373 to 378 GVTASY 423 to 428 GCPAAT 540 to 545 GQQGSC 588 to 593 GLETCLAmidation site 183 to 186 MGRR Aminotransferase class-V pyridoxal 52 to68 YFGGSGHKP- phosphate attachment site RWLGWGVL Sugar (or other)transport domain   2 to 490 Kazal domain  398 to 4441

[0706] Protein 593 comprises domains which exhibit homology with knownsugar (or other) transport domains and with Kazal domains. In oneembodiment, the protein of the invention has at least one domain that isat least 55%, preferably at least about 65%, more preferably at leastabout 75%, yet more preferably at least about 85%, and most preferablyat least about 95% identical to one of these domains. Preferably, theprotein of the invention has at least two domains, each of which is atleast 55%, preferably at least about 65%, more preferably at least about75%, yet more preferably at least about 85%, and most preferably atleast about 95% identical to either the sugar (or other) transportdomain or the Kazal domain of protein 593.

[0707] Sugar (or other) transport domains occur in a variety of proteinsinvolved in transmembrane transport of sugars and other metabolites.Other proteins which comprise such a domain include human glucosetransporters GLUT1, GLUT2, GLUT3, GLUT4, GLUT5, GLUT6, and GLUT7,Escherichia coli proteins AraE (arabinose-proton symporter), GalP(galactose-proton symporter), citrate-proton symport protein, KgtP(α-ketoglutarate permease), ProP (proline/betaine transporter), and XylE(xylose-proton symporter), Escherichia coli hypothetical proteins YabE,YdjE, and YhjE, Klebsiella pneumoniae citrate-proton symport protein,Zymomonas mobilis glucose facilitated diffusion protein, yeast high andlow affinity glucose transport proteins (SNF3 and HXT1 through HXT14),yeast galactose transporter, yeast maltose permease, yeast myo-inositoltransporter, yeast carboxylic acid transporter homolog JEN1, yeasthypothetical proteins YBR241c, YCR98c, and YFL040w, Klyveromyces lactislactose permease, Neurospora crassa quinate transporter, Emericellanidulans quinate permease, Chlorella hexose carrier, Arabidopsisthaliana glucose transporter, spinach sucrose transporter, Leishmaniadonovani transporters D1 and D2, Leishmania enriettii probable transportprotein LTP, Caenorhabditis elegans hypothetical protein ZK637.1,Haemophilus influenzae hypothetical proteins HI0281 and HI0418, andBacillus subtilis hypothetical proteins YxbC and YxdF. Occurrence of asugar (or other) transport domain in protein 593 indicates that protein593 is involved in transmembrane transport of one or more compounds,most likely a compound having a molecular weight on the order of ahexose or greater (i.e. having a molecular weight greater than about180). Examples of such compounds include prostaglandins, thromboxanes,hexoses, disaccharides, hormones (e.g. insulin), peptides,neurotransmitters, cytokines, chemokines, and the like. Protein 593 thusmediates one or more of facilitated diffusion and symport or antiport(e.g. involving co-transport of a proton, a sodium ion, a potassium ion,or another physiological ion). One, both, or neither of aglycosaminoglycan attached at the predicted glycosaminoglycan attachmentsite and a pyridoxal phosphate moiety attached at the predictedpyridoxal phosphate attachment site can, in conjunction with the aminoacid sequence of protein 593, determine the specificity of the proteinfor transporting molecules across the membrane of a cell in which it isexpressed.

[0708] Like human protein 38555, as described above, human protein 593comprises a Kazal domain. Occurrence of a Kazal domain in protein 593indicates that this protein is involved in formation and maintenance ofcell-to-cell interactions, and more particularly that the protein isinvolved in forming and maintaining neural synapses, including bothneuron-to-neuron synapses and neuron-to-non-neural cell synapses (e.g.neuromotor and neuroendocrine synapses).

[0709] Human protein 593 exhibits sequence similarity to HPT (GenBankAccession no. Q92959). Protein 593 is a twelve-transmembrane regionintegral membrane protein having transmembrane regions at approximatelythe following positions within SEQ ID NO:112: from about amino acidresidue 1 to about residue 10; from about amino acid residue 33 to aboutresidue 53; from about amino acid residue 62 to about residue 79; fromabout amino acid residue 118 to about residue 142; from about amino acidresidue 153 to about residue 177; from about amino acid residue 200 toabout residue 221; from about amino acid residue 262 to about residue283; from about amino acid residue 314 to about residue 334; from aboutamino acid residue 347 to about residue 364; from about amino acidresidue 469 to about residue 493; from about amino acid residue 509 toabout residue 528; and from about amino acid residue 556 to aboutresidue 579.

[0710] Extracellular domains are predicted to include approximatelyamino acid residues 11 to 32, 80 to 117, 178 to 199, 284 to 313, 365 to468, and 529 to 555 of SEQ ID NO:112. Intracellular domains arepredicted to include approximately amino acid residues 54 to 61, 143 to152, 222 to 261, 335 to 346, 494 to 508, and 580 to 612 of SEQ IDNO:112.

[0711] Human protein 593 can have additional amino acid residues at theamino terminal end of the sequence listed in SEQ ID NO:112 (i.e. theprotein can have an additional portion at its amino terminus). Forexample, protein 593 can have 1, 2, 4, 6, 10, 15, 20, 25, or 30 or moreadditional amino acid residues at the amino terminus indicated in SEQ IDNO:112.

[0712] As described elsewhere herein, relatively hydrophilic regions aregenerally located at or near the surface of a protein, and are morefrequently effective immunogenic epitopes than are relativelyhydrophobic regions. For example, the region of human protein 593 fromabout amino acid residue 240 to about amino acid residue 260 appears tobe located at or near the surface of the protein, while the region fromabout amino acid residue 415 to about amino acid residue 430 appears notto be located at or near the surface.

[0713] The predicted molecular weight of human protein 593 is about 65.4kilodaltons.

[0714] Biological Function of Human 593 Proteins, Nucleic Acids EncodingThem, and Modulators of These Molecules

[0715] Human 593 proteins are involved in disorders which affect bothtissues in which they are normally expressed and tissues in which theyare normally not expressed. Based on the observation that 593 proteinexhibits amino acid sequence homology to human protein 38555, which isexpressed in monkey brain and is therefore likely expressed in humanbrain tissue, human 593 protein is involved in one or more biologicalprocesses which occur in brain and other neurological tissues, althoughit can also be expressed in other tissues, and involved in disorders inthose tissues as well. In particular, 593 is involved in modulatinggrowth, proliferation, survival, differentiation, and activity of cellsincluding, but not limited to, central nervous system neurons,peripheral nervous system neurons, motor neurons, sensory neurons, andsympathetic and parasympathetic neural cells of the animal in which itis normally expressed. Protein 593 is also involved in mediatinginteractions between cells, particularly between two neurons, or betweena neuron and a non-neuronal cell such as a muscle or endocrine cell.Thus, 593 protein has a role in disorders which affect neuronal cellsand cells which interact with neurons and their growth, proliferation,survival, differentiation, and activity.

[0716] Widespread expression of 593 has been detected among human tissuetypes. Thus, the growth-, proliferation-, survival-, differentiation-,and activity-modulating activities of 593 protein affect cells of manytypes. Thus, protein 593 can affect cell-to-cell interactions in a widevariety of cell types.

[0717] Protein 593 can also be expressed in other tissues which normallyproduce or are acted upon by prostaglandins and thromboxanes. Suchtissues include, by way of example, blood tissues (e.g. bloodplatelets), epithelial tissues such as stomach, kidney, lung, uterus,vascular, and other epithelia, liver, ova, and spermatozoa. Protein 593is thus involved in one or more disorders which affect these tissues,such as one or more of the tissues listed above in the discussionregarding protein 38555.

[0718] The presence of the sugar (or other) transport domain in protein593 indicates that this protein is involved in transmembrane transportof one or more molecules such as neurotransmitters, prostaglandins,thromboxanes, hormones, small peptides, short polysaccharides (e.g.disaccharides), other charged organic compounds, and the like. Theproteins of the invention are therefore involved in one or moredisorders relating to inappropriate uptake or release of such molecules(i.e. including inappropriate failure to take up or release suchmolecules). Protein 593 is thus involved in one or more of a variety ofcellular uptake and release disorders such as diabetes, nutritionaldisorders (e.g. vitamin deficiencies, and malnutrition), metabolicdisorders (e.g. obesity, porphyrias, hyper- and hypolipoproteinemia,lipidoses, and water, electrolyte, mineral, and acid/base imbalances),and neural transmission disorders (e.g. inappropriate pain, dementia,multiple sclerosis, nerve root disorders, Alzheimer's disease,Parkinson's disease, depression, physical and psychological substanceaddiction, sexual dysfunction, schizophrenic disorders, delusionaldisorders, mood disorders, sleep disorders, and the like).

[0719] Occurrence of a Kazal domain in human protein 593 furtherimplicates this protein in neuronal development and neuronaltransmission processes. The presence of this domain therefore indicatesthat 593 protein is involved in disorders relating to inappropriateformation (i.e. including failure to form) and maintenance (i.e.including deterioration) of neuronal synapses, including bothneuron-to-neuron synapses and neuron-to-non-neuronal cell synapses.Thus, in addition to the neural transmission disorders described above,protein 593 is also implicated in disorders such as stroke, regenerationof chronically or traumatically damaged neuronal structures (includingnerve, brain, and spinal cord), developmental neuronal disorders (e.g.spina bifida), neuronal cancers (e.g. gliomas, astrocytomas,ependymomas, pituitary adenomas, and the like), peripheral nervedeficit, coronary insufficiency, angina, and the like.

[0720] The observation that human protein 593 shares sequence homologywith proteins involved in transmembrane prostaglandin transportindicates that 593 protein has activity identical or analogous to theactivity of those proteins, i.e. that 593 catalyzes or facilitatestransmembrane transport of one or more prostaglandins, thromboxanes,other hormones or hormone-like molecules, or other charged organiccompounds. Exemplary molecules which can be transported across cellmembranes via protein 593 include charged organic compounds, such as oneor more of prostaglandins A₁, A₂, B₁, B₂, D₂, E₁, E₂, F_(1α), F_(2α),G₂, H₂, I₂, and J₂ and thromboxanes A₂ and B₂. Uptake and release ofprostaglandins and thromboxanes, for example, are known to be involvedin a variety of physiological processes and disorders includingglaucoma, ovum fertilization, sperm motility, pregnancy, labor,delivery, abortion, gastric protection, peptic ulcer formation,intestinal fluid secretion, liver protection, liver damage, liverfibrosis, pain stimulation, glomerular filtration, maintenance of bodytemperature, fever, airway resistance, asthma, chronic obstructivepulmonary disorder, modulation of blood pressure, hypertension, shock,modulation of inflammation, platelet aggregation, abnormal bloodcoagulation, atherosclerosis, arteriosclerosis, and coronary arterydisease. Thus, polypeptides and nucleic acid molecules of the invention,and compounds which bind with or modulate one or more polypeptides andnucleic acid molecules of the invention can be used to prognosticate,diagnose, inhibit, or treat one or more of the disorders listed above orone or more disorders associated with the physiological processes listedabove.

[0721] Protein KIAA0880

[0722] A cDNA encoding at least a portion of human KIAA0880 protein wasisolated by others from a human brain library of cDNA clones on thebasis of the encoded protein being ‘large’ (Nagase et al. (1998) DNARes. 5:355-364; GenBank submission assigned Accession no. AB020687,submitted Dec. 2, 1998). At the time this cDNA was isolated andsubmitted to GenBank, it was unknown by the isolators whether theencoded protein had any physiological relevance and, if it did, whatthat relevance might be. The present inventor has discovered that theprotein encoded by the cDNA clone identified by Nagase et al. encodes atransmembrane transport protein that catalyzes transmembrane transportof charged organic compounds such as one or more prostaglandins. In viewof this discovery, it is now possible to make use of protein KIAA0880for the treatment of numerous disorders relating to aberranttransmembrane transport of prostaglandins and/or thromboxanes, and forother purposes.

[0723] The full length of the cDNA encoding human protein KIAA0880 (SEQID NO:114) is 4068 nucleotide residues and encodes a 709-amino acidprotein (SEQ ID NO:115) which exhibits amino acid sequence homology withHPT and other prostaglandin transporters.

[0724] KIAA0880 proteins of the invention and nucleic acid moleculesencoding them comprise a family of molecules having certain conservedstructural and functional features, as indicated by its close homologyto HPT (SEQ ID NO:116), the human OatP sodium-independent organic aniontransporter protein (GenBank Accession no. P46721; SEQ ID NO:117), human38555 protein (as described herein, SEQ ID NO:108), and human protein593 (as described herein, SEQ ID NO:112).

[0725] KIAA0880 proteins typically comprise a variety of potentialpost-translational modification sites (often within an extracellulardomain), such as those described herein in Table 53, as predicted bycomputerized sequence analysis of human KIAA0880 protein using aminoacid sequence comparison software (comparing the amino acid sequence ofprotein KIAA0880 with the information in the PROSITE database {rel.12.2; Feb, 1995} and the Hidden Markov Models database {Rel. PFAM 3.3}).In certain embodiments, a protein of the invention has at least 1, 2, 4,6, 8, or 10 or more of the post-translational modification sites listedin Table 53. TABLE 53 Type of Potential Modification Site Amino AcidResidues of Amino Acid or Domain SEQ ID NO:115 Sequence N-glycosylationsite 176 to 179 NCSS 350 to 353 NLTV 538 to 541 NCSC Protein kinase Cphosphorylation site 266 to 268 TIK 337 to 339 STK 367 to 369 TLR 507 to509 STR Casein kinase II phosphorylation site 74 to 77 STVE 92 to 95SFNE 147 to 150 TSPE 179 to 182 SYTE 212 to 215 SYIID 266 to 269 TIKID333 to 336 SPGE 488 to 491 SCME 508 to 511 TRVE 620 to 623 SAIDN-myristoylation site 88 to 93 GLLASF 129 to 134 GLLMTL 175 to 180GNCSSY 228 to 233 GILFAV 239 to 244 GLAFGL 262 to 267 GISLTI 424 to 429GIVVGG 449 to 454 GMLLCL 551 to 556 GSCDST 571 to 576 GSALAC 661 to 666GSVICF Amidation site 633 to 636 CGRR 700 to 703 PGKK MicrobodiesC-terminal targeting 707 to 709 SRV signal

[0726] Protein KIAA0880 is predicted by computerized amino acid sequenceanalysis (using the MEMSAT computer program) to be atwelve-transmembrane region integral membrane protein havingtransmembrane regions at approximately the following positions withinSEQ ID NO:115: from about amino acid residue 50 to about residue 69;from about amino acid residue 88 to about residue 108; from about aminoacid residue 117 to about residue 134; from about amino acid residue 186to about residue 206; from about amino acid residue 225 to about residue249; from about amino acid residue 276 to about residue 297; from aboutamino acid residue 372 to about residue 394; from about amino acidresidue 411 to about residue 432; from about amino acid residue 440 toabout residue 463; from about amino acid residue 564 to about residue587; from about amino acid residue 596 to about residue 612; and fromabout amino acid residue 651 to about residue 673

[0727] Extracellular domains are predicted to include approximatelyamino acid residues 70 to 87, 135 to 185, 250 to 275, 395 to 410, 464 to563, and 613 to 650 of SEQ ID NO:115. Intracellular domains arepredicted to include approximately amino acid residues 1 to 49, 109 to116, 207 to 224, 298 to 371, 433 to 439, 588 to 595, and 674 to 709 ofSEQ ID NO:115.

[0728] As described elsewhere herein, relatively hydrophilic regions aregenerally located at or near the surface of a protein, and are morefrequently effective immunogenic epitopes than are relativelyhydrophobic regions. For example, the region of human protein KIAA0880from about amino acid residue 135 to about amino acid residue 155appears to be located at or near the surface of the protein, while theregion from about amino acid residue 160 to about amino acid residue 165appears not to be located at or near the surface.

[0729] Human protein KIAA0880 exhibits sequence similarity to HPT(GenBank Accession no. Q92959; SEQ ID NO:117). An alignment betweenKIAA0880 (SEQ ID NO:115 and HPT (SEQ ID NO:117), made using the ALIGNprogram of the GCG software package, pam120.mat scoring matrix, gappenalties −12/−4, reveals that the amino acid sequences of the proteinsare 39.5% identical.

[0730] The predicted molecular weight of human protein KIAA0880 is about76.7 kilodaltons.

[0731] Biological Function of Human KIAA0880 Proteins, Nucleic AcidsEncoding Them, and Modulators of These Molecules

[0732] Human KIAA0880 protein is involved in disorders which affect bothtissues in which they are normally expressed and tissues in which theyare normally not expressed. Based on the observation by others thatKIAA0880 protein is expressed in human brain tissue and on the functionof this protein as identified herein, human KIAA0880 protein is involvedin one or more biological processes which occur in brain and otherneurological tissues. In particular, KIAA0880 is involved in modulatinggrowth, proliferation, survival, differentiation, and activity of cellsincluding, but not limited to, central nervous system neurons,peripheral nervous system neurons, motor neurons, sensory neurons, andsympathetic and parasympathetic neural cells of the animal in which itis normally expressed. Protein KIAA0880 is also involved in mediatinginteractions between cells, particularly between two neurons, or betweena neuron and a non-neuronal cell such as a muscle or endocrine cell.Thus, KIAA0880 protein has a role in disorders which affect neuronalcells and cells which interact with neurons and their growth,proliferation, survival, differentiation, and activity.

[0733] Widespread expression of KIAA0880 has been detected among humantissue types. Thus, the growth-, proliferation-, survival-,differentiation-, and activity-modulating activities of KIAA0880 proteinaffect cells of many types. Thus, protein KIAA0880 can affectcell-to-cell interactions in a wide variety of cell types.

[0734] Protein KIAA0880 is involved in transmembrane transport of one ormore charged organic compounds such as prostaglandins, thromboxanes, andthe like. Protein KIAA0880 mediates one or more of facilitated diffusionof the prostaglandin (or thromboxane or the like) and symport orantiport (e.g. involving co-transport of a proton, a sodium ion, apotassium ion, or another physiological ion).

[0735] Protein KIAA0880 is therefore involved in transmembrane transportof charged organic molecules such as one or more prostaglandins andthromboxanes in brain and other neural tissues in humans, and is thusinvolved in, and can be used to prognosticate, prevent, diagnose, ortreat, one or more disorders related to inappropriate transmembranetransport (i.e. including inappropriate failure of transport) ofprostaglandins, thromboxanes, and the like in neural tissues. Suchdisorders include, by way of example, neural transmission disorders(e.g. inappropriate pain, dementia, multiple sclerosis, nerve rootdisorders, Alzheimer's disease, Parkinson's disease, depression,physical and psychological substance addiction, sexual dysfunction,schizophrenic disorders, delusional disorders, mood disorders, sleepdisorders, and the like) and disorders relating to inappropriateformation (i.e. including failure to form) and maintenance (i.e.including deterioration) of neuronal synapses, including bothneuron-to-neuron synapses and neuron-to-non-neuronal cell synapses.Thus, in addition to the neural transmission disorders described above,protein KIAA0880 is also implicated in, and can be used toprognosticate, prevent, diagnose, or treat, one or more disorders suchas stroke, regeneration of chronically or traumatically damaged neuronalstructures (including nerve, brain, and spinal cord), developmentalneuronal disorders (e.g. spina bifida), neuronal cancers (e.g. gliomas,astrocytomas, ependymomas, pituitary adenomas, and the like), peripheralnerve deficit, coronary insufficiency, angina, and the like. Exemplarymolecules which can be transported across cell membranes via proteinKIAA0880 include one or more charged organic compounds such asprostaglandins A₁, A₂, B₁, B₂, D₂, E₁, E₂, F_(1α), F_(2α), G₂, H₂, I₂,and J₂ and thromboxanes A₂ and B₂. Uptake and release of prostaglandinsand thromboxanes, for example, are known to be involved in a variety ofphysiological processes and disorders including glaucoma, ovumfertilization, sperm motility, pregnancy, labor, delivery, abortion,gastric protection, peptic ulcer formation, intestinal fluid secretion,liver protection, liver damage, liver fibrosis, pain stimulation,glomerular filtration, maintenance of body temperature, fever, airwayresistance, asthma, chronic obstructive pulmonary disorder, modulationof blood pressure, hypertension, shock, modulation of inflammation,platelet aggregation, abnormal blood coagulation, atherosclerosis,arteriosclerosis, and coronary artery disease. Thus, polypeptides andnucleic acid molecules of the invention, and compounds which bind withor modulate one or more polypeptides and nucleic acid molecules of theinvention can be used to prognosticate, diagnose, inhibit, or treat oneor more of the disorders listed above or one or more disordersassociated with the physiological processes listed above.

[0736] Biological Deposit

[0737] Clones encoding human 38555 and 593 proteins were deposited withATCC on Jul. 22, 1999 in the form of a mixture of two plasmids, one(Ep65h2) encoding protein 38555, the other (Ep593) encoding protein 593.This deposit will be maintained under the terms of the Budapest Treatyon the International Recognition of the Deposit of Microorganisms forthe Purposes of Patent Procedure.

[0738] In order to check for the presence of Ep65h2 and Ep593 in thedeposited mixture, an E. coli host strain (e.g. DH5a) is transformedusing the mixture and plated and incubated on Luria broth platescontaining 100 micrograms per milliliter ampicillin. About 10 to 20transformants are selected and subjected to a standard plasmidminipreparation procedure. Each DNA is digested using restrictionendonuclease EcoRI and the fragments are separated by, for example,agarose gel electrophoresis. Fragments are visualized (e.g. usingethidium bromide in the agarose gel). EcoRI digestion of Ep62h5 yieldsone band approximately 5.5 kB in size. EcoRI digestion of Ep62h5 yieldstwo bands, one having a size of about 3.5 kB, and the other having asize of about 1.5 kB.

[0739] This deposit was made merely as a convenience to those of skillin the art. This deposit is not an admission that a deposit is requiredpursuant to 35 U.S.C. §112.

[0740] Definitions

[0741] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein, fragments thereof, and derivatives and other variants ofthe sequence in SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55,58, 64, 67, 72, 89, 105, 108 or 112 thereof are collectively referred toas “polypeptides or proteins of the invention” or “21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptides orproteins”. Nucleic acid molecules encoding such polypeptides or proteinsare collectively referred to as “nucleic acids of the invention” or“21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593nucleic acids.”

[0742] As used herein, the term “nucleic acid molecule” includes DNAmolecules (e.g., a cDNA or genomic DNA) and RNA molecules (e.g., anmRNA) and analogs of the DNA or RNA generated, e.g., by the use ofnucleotide analogs. The nucleic acid molecule can be single-stranded ordouble-stranded, but preferably is double-stranded DNA.

[0743] The term “isolated or purified nucleic acid molecule” includesnucleic acid molecules which are separated from other nucleic acidmolecules which are present in the natural source of the nucleic acid.For example, with regards to genomic DNA, the term “isolated” includesnucleic acid molecules which are separated from the chromosome withwhich the genomic DNA is naturally associated. Preferably, an “isolated”nucleic acid is free of sequences which naturally flank the nucleic acid(i.e., sequences located at the 5′ and/or 3′ ends of the nucleic acid)in the genomic DNA of the organism from which the nucleic acid isderived. For example, in various embodiments, the isolated nucleic acidmolecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5kb or 0.1 kb of 5′ and/or 3′ nucleotide sequences which naturally flankthe nucleic acid molecule in genomic DNA of the cell from which thenucleic acid is derived. Moreover, an “isolated” nucleic acid molecule,such as a cDNA molecule, can be substantially free of other cellularmaterial or culture medium when produced by recombinant techniques, orsubstantially free of chemical precursors or other chemicals whenchemically synthesized.

[0744] As used herein, the term “hybridizes under low stringency, mediumstringency, high stringency, or very high stringency conditions”describes conditions for hybridization and washing. Guidance forperforming hybridization reactions can be found in Current Protocols inMolecular Biology (1989) John Wiley & Sons, N.Y., 6.3.1-6.3.6, which isincorporated by reference. Aqueous and nonaqueous methods are describedin that reference and either can be used. Specific hybridizationconditions referred to herein are as follows: 1) low stringencyhybridization conditions in 6× sodium chloride/sodium citrate (SSC) atabout 45° C., followed by two washes in 0.2×SSC, 0.1% SDS at least at50° C. (the temperature of the washes can be increased to 55° C. for lowstringency conditions); 2) medium stringency hybridization conditions in6×SSC at about 45° C., followed by one or more washes in 0.2×SSC, 0.1%SDS at 60° C.; 3) high stringency hybridization conditions in 6×SSC atabout 45° C., followed by one or more washes in 0.2×SSC, 0.1% SDS at 65°C.; and preferably 4) very high stringency hybridization conditions are0.5M sodium phosphate, 7% SDS at 65° C., followed by one or more washesat 0.2×SSC, 1% SDS at 65° C. Very high stringency conditions (4) are thepreferred conditions and the ones that should be used unless otherwisespecified.

[0745] As used herein, a “naturally-occurring” nucleic acid moleculerefers to an RNA or DNA molecule having a nucleotide sequence thatoccurs in nature (e.g., encodes a natural protein).

[0746] As used herein, the terms “gene” and “recombinant gene” refer tonucleic acid molecules which include an open reading frame encoding a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein, preferably a mammalian 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein, and can further include non-codingregulatory sequences, and introns.

[0747] An “isolated” or “purified” polypeptide or protein issubstantially free of cellular material or other contaminating proteinsfrom the cell or tissue source from which the protein is derived, orsubstantially free from chemical precursors or other chemicals whenchemically synthesized. In one embodiment, the language “substantiallyfree” means preparation of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein having less than about 30%, 20%, 10%and more preferably 5% (by dry weight), of non-21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein (also referredto herein as a “contaminating protein”), or of chemical precursors ornon-21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593chemicals. When the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein or biologically active portion thereof isrecombinantly produced, it is also preferably substantially free ofculture medium, i.e., culture medium represents less than about 20%,more preferably less than about 10%, and most preferably less than about5% of the volume of the protein preparation. The invention includesisolated or purified preparations of at least 0.01, 0.1, 1.0, and 10milligrams in dry weight.

[0748] A “non-essential” amino acid residue is a residue that can bealtered from the wild-type sequence of 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 (e.g., the sequence of SEQ ID NO:1, 3,5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48,49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107,109, 111 or 113) without abolishing or more preferably, withoutsubstantially altering a biological activity, whereas an “essential”amino acid residue results in such a change. For example, amino acidresidues that are conserved among the polypeptides of the presentinvention, e.g., those present in the conserved domains, are predictedto be particularly unamenable to alteration.

[0749] A “conservative amino acid substitution” is one in which theamino acid residue is replaced with an amino acid residue having asimilar side chain. Families of amino acid residues having similar sidechains have been defined in the art. These families include amino acidswith basic side chains (e.g., lysine, arginine, histidine), acidic sidechains (e.g., aspartic acid, glutamic acid), uncharged polar side chains(e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine,cysteine), nonpolar side chains (e.g., alanine, valine, leucine,isoleucine, proline, phenylalanine, methionine, tryptophan),beta-branched side chains (e.g., threonine, valine, isoleucine) andaromatic side chains (e.g., tyrosine, phenylalanine, tryptophan,histidine). Thus, a predicted nonessential amino acid residue in a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein is preferably replaced with another amino acid residue from thesame side chain family. Alternatively, in another embodiment, mutationscan be introduced randomly along all or part of a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 coding sequence, such asby saturation mutagenesis, and the resultant mutants can be screened for21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593biological activity to identify mutants that retain activity. Followingmutagenesis of SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31,33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71,73, 88, 90, 104, 106, 107, 109, 111 or 113, the encoded protein can beexpressed recombinantly and the activity of the protein can bedetermined.

[0750] As used herein, a “biologically active portion” of a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 proteinincludes a fragment of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein which participates in an interaction betweena 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593molecule and a non-21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 molecule. Biologically active portions of a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 proteininclude peptides comprising amino acid sequences sufficiently homologousto or derived from the amino acid sequence of the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein, e.g., the aminoacid sequence shown in SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47,50, 55, 58, 64, 67, 72, 89, 105, 108 or 112, which include fewer aminoacids than the full length 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein, and exhibit at least one activity ofa 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein. Typically, biologically active portions comprise a domain ormotif with at least one activity of the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein. A biologically activeportion of a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein can be a polypeptide which is, for example, 10, 25,50, 100, 200 or more amino acids in length. Biologically active portionsof a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein can be used as targets for developing agents which modulate a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593mediated activity.

[0751] Calculations of homology or sequence identity (the terms“homology” and “identity” are used interchangeably herein) betweensequences are performed as follows:

[0752] To determine the percent identity of two amino acid sequences, orof two nucleic acid sequences, the sequences are aligned for optimalcomparison purposes (e.g., gaps can be introduced in one or both of afirst and a second amino acid or nucleic acid sequence for optimalalignment and non-homologous sequences can be disregarded for comparisonpurposes). In a preferred embodiment, the length of a reference sequencealigned for comparison purposes is at least 30%, preferably at least40%, more preferably at least 50%, even more preferably at least 60%,and even more preferably at least 70%, 80%, 90%, 100% of the length ofthe reference sequence. The amino acid residues or nucleotides atcorresponding amino acid positions or nucleotide positions are thencompared. When a position in the first sequence is occupied by the sameamino acid residue or nucleotide as the corresponding position in thesecond sequence, then the molecules are identical at that position (asused herein amino acid or nucleic acid “identity” is equivalent to aminoacid or nucleic acid “homology”). The percent identity between the twosequences is a function of the number of identical positions shared bythe sequences, taking into account the number of gaps, and the length ofeach gap, which need to be introduced for optimal alignment of the twosequences.

[0753] The comparison of sequences and determination of percent identitybetween two sequences can be accomplished using a mathematicalalgorithm. In a preferred embodiment, the percent identity between twoamino acid sequences is determined using the Needleman and Wunsch (1970)J. Mol. Biol. 48:444-453 algorithm which has been incorporated into theGAP program in the GCG software package using either a Blossum 62 matrixor a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and alength weight of 1, 2, 3, 4, 5, or 6. In yet another preferredembodiment, the percent identity between two nucleotide sequences isdetermined using the GAP program in the GCG software package using aNWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and alength weight of 1, 2, 3, 4, 5, or 6. A particularly preferred set ofparameters (and the one that should be used if the practitioner isuncertain about what parameters should be applied to determine if amolecule is within a sequence identity or homology limitation of theinvention) are a Blossum 62 scoring matrix with a gap penalty of 12, agap extend penalty of 4, and a frameshift gap penalty of 5.

[0754] The percent identity between two amino acid or nucleotidesequences can be determined using the algorithm of Meyers and Miller((1989) CABIOS, 4:11-17) which has been incorporated into the ALIGNprogram (version 2.0), using a PAM 120 weight residue table, a gaplength penalty of 12 and a gap penalty of 4.

[0755] The nucleic acid and protein sequences described herein can beused as a “query sequence” to perform a search against public databasesto, for example, identify other family-members or related sequences.Such searches can be performed using the NBLAST and XBLAST programs(version 2.0) of Altschul et al. (1990) J. Mol. Biol. 215:403-10. BLASTnucleotide searches can be performed with the NBLAST program, score=100,wordlength=12 to obtain nucleotide sequences homologous to 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleic acidmolecules of the invention. BLAST protein searches can be performed withthe XBLAST program, score=50, wordlength=3 to obtain amino acidsequences homologous to 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein molecules of the invention. To obtain gappedalignments for comparison purposes, Gapped BLAST can be utilized asdescribed in Altschul et al., (1997) Nucleic Acids Res. 25:3389-3402.When utilizing BLAST and Gapped BLAST programs, the default parametersof the respective programs (e.g., XBLAST and NBLAST) can be used.

[0756] Particular 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 polypeptides of the present invention have an amino acidsequence substantially identical to the amino acid sequence of SEQ IDNO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89,105, 108 or 112. In the context of an amino acid sequence, the term“substantially identical” is used herein to refer to a first amino acidthat contains a sufficient or minimum number of amino acid residues thatare i) identical to, or ii) conservative substitutions of aligned aminoacid residues in a second amino acid sequence such that the first andsecond amino acid sequences can have a common structural domain and/orcommon functional activity. For example, amino acid sequences thatcontain a common structural domain having at least about 60%, or 65%identity, likely 75% identity, more likely 85%, 90%. 91%, 92%, 93%, 94%,95%, 96%, 97%, 98% or 99% identity to SEQ ID NO:2, 6, 11, 19, 22, 25,32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112 are termedsubstantially identical.

[0757] In the context of nucleotide sequence, the term “substantiallyidentical” is used herein to refer to a first nucleic acid sequence thatcontains a sufficient or minimum number of nucleotides that areidentical to aligned nucleotides in a second nucleic acid sequence suchthat the first and second nucleotide sequences encode a polypeptidehaving common functional activity, or encode a common structuralpolypeptide domain or a common functional polypeptide activity. Forexample, nucleotide sequences having at least about 60%, or 65%identity, likely 75% identity, more likely 85%, 90%. 91%, 92%, 93%, 94%,95%, 96%, 97%, 98% or 99% identity to SEQ ID NO:1, 3, 5, 7, 10, 12, 18,20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57,59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113 aretermed substantially identical.

[0758] “Misexpression or aberrant expression”, as used herein, refers toa non-wild type pattern of gene expression, at the RNA or protein level.It includes: expression at non-wild type levels, i.e., over or underexpression; a pattern of expression that differs from wild type in termsof the time or stage at which the gene is expressed, e.g., increased ordecreased expression (as compared with wild type) at a predetermineddevelopmental period or stage; a pattern of expression that differs fromwild type in terms of decreased expression (as compared with wild type)in a predetermined cell type or tissue type; a pattern of expressionthat differs from wild type in terms of the splicing size, amino acidsequence, post-transitional modification, or biological activity of theexpressed polypeptide; a pattern of expression that differs from wildtype in terms of the effect of an environmental stimulus orextracellular stimulus on expression of the gene, e.g., a pattern ofincreased or decreased expression (as compared with wild type) in thepresence of an increase or decrease in the strength of the stimulus.

[0759] “Subject”, as used herein, can refer to a mammal, e.g., a human,or to an experimental or animal or disease model. The subject can alsobe a non-human animal, e.g., a horse, cow, goat, or other domesticanimal.

[0760] A “purified preparation of cells”, as used herein, refers to, inthe case of plant or animal cells, an in vitro preparation of cells andnot an entire intact plant or animal. In the case of cultured cells ormicrobial cells, it consists of a preparation of at least 10% and morepreferably 50% of the subject cells.

[0761] As used herein, cellular proliferative and/or differentiativedisorders include cancer, e.g., carcinoma, sarcoma, metastatic disordersor hematopoietic neoplastic disorders, e.g., leukemias. A metastatictumor can arise from a multitude of primary tumor types, including butnot limited to those of prostate, colon, lung, breast and liver origin.

[0762] As used herein, the term “cancer” (also used interchangeably withthe terms, “hyperproliferative” and “neoplastic”) refers to cells havingthe capacity for autonomous growth, i.e., an abnormal state or conditioncharacterized by rapidly proliferating cell growth. Cancerous diseasestates may be categorized as pathologic, i.e., characterizing orconstituting a disease state, e.g., malignant tumor growth, or may becategorized as non-pathologic, i.e., a deviation from normal but notassociated with a disease state, e.g., cell proliferation associatedwith wound repair. The term is meant to include all types of cancerousgrowths or oncogenic processes, metastatic tissues or malignantlytransformed cells, tissues, or organs, irrespective of histopathologictype or stage of invasiveness. The term “cancer” includes malignanciesof the various organ systems, such as those affecting lung, breast,cervix, ovary, thyroid, lymphoid, gastrointestinal, and genito-urinarytract, as well as adenocarcinomas which include malignancies such asmost colon cancers, renal-cell carcinoma, prostate cancer and/ortesticular tumors, non-small cell carcinoma of the lung, cancer of thesmall intestine and cancer of the esophagus. The term “carcinoma” is artrecognized and refers to malignancies of epithelial or endocrine tissuesincluding respiratory system carcinomas, gastrointestinal systemcarcinomas, genitourinary system carcinomas, testicular carcinomas,breast carcinomas, prostatic carcinomas, endocrine system carcinomas,and melanomas. Exemplary carcinomas include those forming from tissue ofthe cervix, lung, prostate, breast, head and neck, colon and ovary. Theterm “carcinoma” also includes carcinosarcomas, e.g., which includemalignant tumors composed of carcinomatous and sarcomatous tissues. An“adenocarcinoma” refers to a carcinoma derived from glandular tissue orin which the tumor cells form recognizable glandular structures. Theterm “sarcoma” is art recognized and refers to malignant tumors ofmesenchymal derivation.

[0763] Examples of cellular proliferative and/or differentiativedisorders of the lung include, but are not limited to, tumors such asbronchogenic carcinoma, including paraneoplastic syndromes,bronchioloalveolar carcinoma, neuroendocrine tumors, such as bronchialcarcinoid, miscellaneous tumors, metastatic tumors, and pleural tumors,including solitary fibrous tumors (pleural fibroma) and malignantmesothelioma.

[0764] Examples of cellular proliferative and/or differentiativedisorders of the breast include, but are not limited to, proliferativebreast disease including, e.g., epithelial hyperplasia, sclerosingadenosis, and small duct papillomas; tumors, e.g., stromal tumors suchas fibroadenoma, phyllodes tumor, and sarcomas, and epithelial tumorssuch as large duct papilloma; carcinoma of the breast including in situ(noninvasive) carcinoma that includes ductal carcinoma in situ(including Paget's disease) and lobular carcinoma in situ, and invasive(infiltrating) carcinoma including, but not limited to, invasive ductalcarcinoma, invasive lobular carcinoma, medullary carcinoma, colloid(mucinous) carcinoma, tubular carcinoma, and invasive papillarycarcinoma, and miscellaneous malignant neoplasms. Disorders in the malebreast include, but are not limited to, gynecomastia and carcinoma.

[0765] Examples of cellular proliferative and/or differentiativedisorders involving the colon include, but are not limited to, tumors ofthe colon, such as non-neoplastic polyps, adenomas, familial syndromes,colorectal carcinogenesis, colorectal carcinoma, and carcinoid tumors.

[0766] Examples of cancers or neoplastic conditions, in addition to theones described above, include, but are not limited to, a fibrosarcoma,myosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma,angiosarcoma, endotheliosarcoma, lymphangiosarcoma,lymphangioendotheliosarcoma, synovioma, mesothelioma, Ewing's tumor,leiomyosarcoma, rhabdomyosarcoma, gastric cancer, esophageal cancer,rectal cancer, pancreatic cancer, ovarian cancer, prostate cancer,uterine cancer, cancer of the head and neck, skin cancer, brain cancer,squamous cell carcinoma, sebaceous gland carcinoma, papillary carcinoma,papillary adenocarcinoma, cystadenocarcinoma, medullary carcinoma,bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile ductcarcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilm's tumor,cervical cancer, testicular cancer, small cell lung carcinoma, non-smallcell lung carcinoma, bladder carcinoma, epithelial carcinoma, glioma,astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma,hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma,melanoma, neuroblastoma, retinoblastoma, leukemia, lymphoma, or Kaposisarcoma.

[0767] Proliferative disorders include hematopoietic neoplasticdisorders. As used herein, the term “hematopoietic neoplastic disorders”includes diseases involving hyperplastic/neoplastic cells ofhematopoietic origin, e.g., arising from myeloid, lymphoid or erythroidlineages, or precursor cells thereof. Preferably, the diseases arisefrom poorly differentiated acute leukemias, e.g., erythroblasticleukemia and acute megakaryoblastic leukemia. Additional exemplarymyeloid disorders include, but are not limited to, acute promyeloidleukemia (APML), acute myelogenous leukemia (AML) and chronicmyelogenous leukemia (CML) (reviewed in Vaickus (1991) Crit Rev. inOncol./Hemotol. 11:267-97); lymphoid malignancies include, but are notlimited to acute lymphoblastic leukemia (ALL) which includes B-lineageALL and T-lineage ALL, chronic lymphocytic leukemia (CLL),prolymphocytic leukemia (PLL), hairy cell leukemia (HLL) andWaldenstrom's macroglobulinemia (WM). Additional forms of malignantlymphomas include, but are not limited to non-Hodgkin lymphoma andvariants thereof, peripheral T cell lymphomas, adult T cellleukemia/lymphoma (ATL), cutaneous T-cell lymphoma (CTCL), largegranular lymphocytic leukemia (LGF), Hodgkin's disease andReed-Sternberg disease.

[0768] As used herein, disorders of the breast include, but are notlimited to, disorders of development; inflammations, including but notlimited to, acute mastitis, periductal mastitis, periductal mastitis(recurrent subareolar abscess, squamous metaplasia of lactiferousducts), mammary duct ectasia, fat necrosis, granulomatous mastitis, andpathologies associated with silicone breast implants; fibrocysticchanges; proliferative breast disease including, but not limited to,epithelial hyperplasia, sclerosing adenosis, and small duct papillomas;tumors including, but not limited to, stromal tumors such asfibroadenoma, phyllodes tumor, and sarcomas, and epithelial tumors suchas large duct papilloma; carcinoma of the breast including in situ(noninvasive) carcinoma that includes ductal carcinoma in situ(including Paget's disease) and lobular carcinoma in situ, and invasive(infiltrating) carcinoma including, but not limited to, invasive ductalcarcinoma, no special type, invasive lobular carcinoma, medullarycarcinoma, colloid (mucinous) carcinoma, tubular carcinoma, and invasivepapillary carcinoma, and miscellaneous malignant neoplasms. Disorders inthe male breast include, but are not limited to, gynecomastia andcarcinoma.

[0769] As used herein, disorders involving the colon include, but arenot limited to, congenital anomalies, such as atresia and stenosis,Meckel diverticulum, congenital aganglionic megacolon-Hirschsprungdisease; enterocolitis, such as diarrhea and dysentery, infectiousenterocolitis, including viral gastroenteritis, bacterial enterocolitis,necrotizing enterocolitis, antibiotic-associated colitis(pseudomembranous colitis), and collagenous and lymphocytic colitis,miscellaneous intestinal inflammatory disorders, including parasites andprotozoa, acquired immunodeficiency syndrome, transplantation,drug-induced intestinal injury, radiation enterocolitis, neutropeniccolitis (typhlitis), and diversion colitis; idiopathic inflammatorybowel disease, such as Crohn disease and ulcerative colitis; tumors ofthe colon, such as non-neoplastic polyps, adenomas, familial syndromes,colorectal carcinogenesis, colorectal carcinoma, and carcinoid tumors.

[0770] As used herein, disorders involving the kidney (or renaldisorders) include, but are not limited to, congenital anomaliesincluding, but not limited to, cystic diseases of the kidney, thatinclude but are not limited to, cystic renal dysplasia, autosomaldominant (adult) polycystic kidney disease, autosomal recessive(childhood) polycystic kidney disease, and cystic diseases of renalmedulla, which include, but are not limited to, medullary sponge kidney,and nephronophthisis-uremic medullary cystic disease complex, acquired(dialysis-associated) cystic disease, such as simple cysts; glomerulardiseases including pathologies of glomerular injury that include, butare not limited to, in situ immune complex deposition, that includes,but is not limited to, anti-GBM nephritis, Heymann nephritis, andantibodies against planted antigens, circulating immune complexnephritis, antibodies to glomerular cells, cell-mediated immunity inglomerulonephritis, activation of alternative complement pathway,epithelial cell injury, and pathologies involving mediators ofglomerular injury including cellular and soluble mediators, acuteglomerulonephritis, such as acute proliferative (poststreptococcal,postinfectious) glomerulonephritis, including but not limited to,poststreptococcal glomerulonephritis and nonstreptococcal acuteglomerulonephritis, rapidly progressive (crescentic) glomerulonephritis,nephrotic syndrome, membranous glomerulonephritis (membranousnephropathy), minimal change disease (lipoid nephrosis), focal segmentalglomerulosclerosis, membranoproliferative glomerulonephritis, IgAnephropathy (Berger disease), focal proliferative and necrotizingglomerulonephritis (focal glomerulonephritis), hereditary nephritis,including but not limited to, Alport syndrome and thin membrane disease(benign familial hematuria), chronic glomerulonephritis, glomerularlesions associated with systemic disease, including but not limited to,systemic lupus erythematosus, Henoch-Schönlein purpura, bacterialendocarditis, diabetic glomerulosclerosis, amyloidosis, fibrillary andimmunotactoid glomerulonephritis, and other systemic disorders; diseasesaffecting tubules and interstitium, including acute tubular necrosis andtubulointerstitial nephritis, including but not limited to,pyelonephritis and urinary tract infection, acute pyelonephritis,chronic pyelonephritis and reflux nephropathy, and tubulointerstitialnephritis induced by drugs and toxins, including but not limited to,acute drug-induced interstitial nephritis, analgesic abuse nephropathy,nephropathy associated with non steroidal anti-inflammatory drugs, andother tubulointerstitial diseases including, but not limited to, uratenephropathy, hypercalcemia and nephrocalcinosis, and multiple myeloma;diseases of blood vessels including benign nephrosclerosis, malignanthypertension and accelerated nephrosclerosis, renal artery stenosis, andthrombotic microangiopathies including, but not limited to, classic(childhood) hemolytic-uremic syndrome, adult hemolytic-uremicsyndrome/thrombotic thrombocytopenic purpura, idiopathic HUS/TTP, andother vascular disorders including, but not limited to, atheroscleroticischemic renal disease, atheroembolic renal disease, sickle cell diseasenephropathy, diffuse cortical necrosis, and renal infarcts; urinarytract obstruction (obstructive uropathy); urolithiasis (renal calculi,stones); and tumors of the kidney including, but not limited to, benigntumors, such as renal papillary adenoma, renal fibroma or hamartoma(renomedullary interstitial cell tumor), angiomyolipoma, and oncocytoma,and malignant tumors, including renal cell carcinoma (hypernephroma,adenocarcinoma of kidney), which includes urothelial carcinomas of renalpelvis.

[0771] Examples of disorders of the lung include, but are not limitedto, congenital anomalies; atelectasis; diseases of vascular origin, suchas pulmonary congestion and edema, including hemodynamic pulmonary edemaand edema caused by microvascular injury, adult respiratory distresssyndrome (diffuse alveolar damage), pulmonary embolism, hemorrhage, andinfarction, and pulmonary hypertension and vascular sclerosis; chronicobstructive pulmonary disease, such as emphysema, chronic bronchitis,bronchial asthma, and bronchiectasis; diffuse interstitial(infiltrative, restrictive) diseases, such as pneumoconioses,sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitialpneumonitis, hypersensitivity pneumonitis, pulmonary eosinophilia(pulmonary infiltration with eosinophilia), Bronchiolitisobliterans-organizing pneumonia, diffuse pulmonary hemorrhage syndromes,including Goodpasture syndrome, idiopathic pulmonary hemosiderosis andother hemorrhagic syndromes, pulmonary involvement in collagen vasculardisorders, and pulmonary alveolar proteinosis; complications oftherapies, such as drug-induced lung disease, radiation-induced lungdisease, and lung transplantation; tumors, such as bronchogeniccarcinoma, including paraneoplastic syndromes, bronchioloalveolarcarcinoma, neuroendocrine tumors, such as bronchial carcinoid,miscellaneous tumors, and metastatic tumors; pathologies of the pleura,including inflammatory pleural effusions, noninflammatory pleuraleffusions, pneumothorax, and pleural tumors, including solitary fibroustumors (pleural fibroma) and malignant mesothelioma.

[0772] As used herein. disorders involving the pancreas include those ofthe exocrine pancreas such as congenital anomalies, including but notlimited to, ectopic pancreas; pancreatitis, including but not limitedto, acute pancreatitis; cysts, including but not limited to,pseudocysts; tumors, including but not limited to, cystic tumors andcarcinoma of the pancreas; and disorders of the endocrine pancreas suchas, diabetes mellitus; islet cell tumors, including but not limited to,insulinomas, gastrinomas, and other rare islet cell tumors.

[0773] As used herein, disorders involving the ovary include, forexample, polycystic ovarian disease, Stein-leventhal syndrome,Pseudomyxoma peritonei and stromal hyperthecosis; ovarian tumors suchas, tumors of coelomic epithelium, serous tumors, mucinous tumors,endometeriod tumors, clear cell adenocarcinoma, cystadenofibroma,brenner tumor, surface epithelial tumors; germ cell tumors such asmature (benign) teratomas, monodermal teratomas, immature malignantteratomas, dysgerminoma, endodermal sinus tumor, choriocarcinoma; sexcord-stomal tumors such as, granulosa-theca cell tumors,thecoma-fibromas, androblastomas, hill cell tumors, and gonadoblastoma;and metastatic tumors such as Krukenberg tumors.

[0774] Aberrant expression and/or activity of the molecules of theinvention can mediate disorders associated with bone metabolism. “Bonemetabolism” refers to direct or indirect effects in the formation ordegeneration of bone structures, e.g., bone formation, bone resorption,etc., which can ultimately affect the concentrations in serum of calciumand phosphate. This term also includes activities mediated by themolecules of the invention in bone cells, e.g. osteoclasts andosteoblasts, that can in turn result in bone formation and degeneration.For example, molecules of the invention can support different activitiesof bone resorbing osteoclasts such as the stimulation of differentiationof monocytes and mononuclear phagocytes into osteoclasts. Accordingly,molecules of the invention that modulate the production of bone cellscan influence bone formation and degeneration, and thus can be used totreat bone disorders. Examples of such disorders include, but are notlimited to, osteoporosis, osteodystrophy, osteomalacia, rickets,osteitis fibrosa cystica, renal osteodystrophy, osteosclerosis,anti-convulsant treatment, osteopenia, fibrogenesis-imperfecta ossium,secondary hyperparathyrodism, hypoparathyroidism, hyperparathyroidism,cirrhosis, obstructive jaundice, drug induced metabolism, medullarycarcinoma, chronic renal disease, rickets, sarcoidosis, glucocorticoidantagonism, malabsorption syndrome, steatorrhea, tropical sprue,idiopathic hypercalcemia and milk fever.

[0775] As used herein, “a prostate disorder” refers to an abnormalcondition occurring in the male pelvic region characterized by, e.g.,male sexual dysfunction and/or urinary symptoms. This disorder may bemanifested in the form of genitourinary inflammation (e.g., inflammationof smooth muscle cells) as in several common diseases of the prostateincluding prostatitis, benign prostatic hyperplasia and cancer, e.g.,adenocarcinoma or carcinoma, of the prostate.

[0776] Examples of immune, e.g., inflammatory, (e.g. respiratoryinflammatory) disorders or diseases include, but are not limited to,autoimmune diseases (including, for example, diabetes mellitus,arthritis (including rheumatoid arthritis, juvenile rheumatoidarthritis, osteoarthritis, psoriatic arthritis), multiple sclerosis,encephalomyelitis, myasthenia gravis, systemic lupus erythematosis,autoimmune thyroiditis, dermatitis (including atopic dermatitis andeczematous dermatitis), psoriasis, Sjögren's Syndrome, inflammatorybowel disease, e.g. Crohn's disease and ulcerative colitis, aphthousulcer, iritis, conjunctivitis, keratoconjunctivitis, asthma, allergicasthma, chronic obstructive pulmonary disease, cutaneous lupuserythematosus, scleroderma, vaginitis, proctitis, drug eruptions,leprosy reversal reactions, erythema nodosum leprosum, autoimmuneuveitis, allergic encephalomyelitis, acute necrotizing hemorrhagicencephalopathy, idiopathic bilateral progressive sensorineural hearingloss, aplastic anemia, pure red cell anemia, idiopathicthrombocytopenia, polychondritis, Wegener's granulomatosis, chronicactive hepatitis, Stevens-Johnson syndrome, idiopathic sprue, lichenplanus, Graves' disease, sarcoidosis, primary biliary cirrhosis, uveitisposterior, and interstitial lung fibrosis), graft-versus-host disease,cases of transplantation, and allergy such as, atopic allergy.

[0777] As used herein, disorders involving the heart, or “cardiovasculardisease” or a “cardiovascular disorder” includes a disease or disorderwhich affects the cardiovascular system, e.g., the heart, the bloodvessels, and/or the blood. A cardiovascular disorder can be caused by animbalance in arterial pressure, a malfunction of the heart, or anocclusion of a blood vessel, e.g., by a thrombus. A cardiovasculardisorder includes, but is not limited to disorders such asarteriosclerosis, atherosclerosis, cardiac hypertrophy, ischemiareperfusion injury, restenosis, arterial inflammation, vascular wallremodeling, ventricular remodeling, rapid ventricular pacing, coronarymicroembolism, tachycardia, bradycardia, pressure overload, aorticbending, coronary artery ligation, vascular heart disease, valvulardisease, including but not limited to, valvular degeneration caused bycalcification, rheumatic heart disease, endocarditis, or complicationsof artificial valves; atrial fibrillation, long-QT syndrome, congestiveheart failure, sinus node dysfunction, angina, heart failure,hypertension, atrial fibrillation, atrial flutter, pericardial disease,including but not limited to, pericardial effusion and pericarditis;cardiomyopathies, e.g., dilated cardiomyopathy or idiopathiccardiomyopathy, myocardial infarction, coronary artery disease, coronaryartery spasm, ischemic disease, arrhythmia, sudden cardiac death, andcardiovascular developmental disorders (e.g., arteriovenousmalformations, arteriovenous fistulae, raynaud's syndrome, neurogenicthoracic outlet syndrome, causalgia/reflex sympathetic dystrophy,hemangioma, aneurysm, cavernous angioma, aortic valve stenosis, atrialseptal defects, atrioventricular canal, coarctation of the aorta,ebsteins anomaly, hypoplastic left heart syndrome, interruption of theaortic arch, mitral valve prolapse, ductus arteriosus, patent foramenovale, partial anomalous pulmonary venous return, pulmonary atresia withventricular septal defect, pulmonary atresia without ventricular septaldefect, persistance of the fetal circulation, pulmonary valve stenosis,single ventricle, total anomalous pulmonary venous return, transpositionof the great vessels, tricuspid atresia, truncus arteriosus, ventricularseptal defects). A cardiovascular disease or disorder also can includean endothelial cell disorder.

[0778] As used herein, disorders involving the brain include, but arenot limited to, disorders involving neurons, and disorders involvingglia, such as astrocytes, oligodendrocytes, ependymal cells, andmicroglia; cerebral edema, raised intracranial pressure and herniation,and hydrocephalus; malformations and developmental diseases, such asneural tube defects, forebrain anomalies, posterior fossa anomalies, andsyringomyelia and hydromyelia; perinatal brain injury; cerebrovasculardiseases, such as those related to hypoxia, ischemia, and infarction,including hypotension, hypoperfusion, and low-flow states—globalcerebral ischemia and focal cerebral ischemia—infarction fromobstruction of local blood supply, intracranial hemorrhage, includingintracerebral (intraparenchymal) hemorrhage, subarachnoid hemorrhage andruptured berry aneurysms, and vascular malformations, hypertensivecerebrovascular disease, including lacunar infarcts, slit hemorrhages,and hypertensive encephalopathy; infections, such as acute meningitis,including acute pyogenic (bacterial) meningitis and acute aseptic(viral) meningitis, acute focal suppurative infections, including brainabscess, subdural empyema, and extradural abscess, chronic bacterialmeningoencephalitis, including tuberculosis and mycobacterioses,neurosyphilis, and neuroborreliosis (Lyme disease), viralmeningoencephalitis, including arthropod-borne (Arbo) viralencephalitis, Herpes simplex virus Type 1, Herpes simplex virus Type 2,Varicella-zoster virus (Herpes zoster), cytomegalovirus, poliomyelitis,rabies, and human immunodeficiency virus 1, including HIV-1meningoencephalitis (subacute encephalitis), vacuolar myelopathy,AIDS-associated myopathy, peripheral neuropathy, and AIDS in children,progressive multifocal leukoencephalopathy, subacute sclerosingpanencephalitis, fungal meningoencephalitis, other infectious diseasesof the nervous system; transmissible spongiform encephalopathies (priondiseases); demyelinating diseases, including multiple sclerosis,multiple sclerosis variants, acute disseminated encephalomyelitis andacute necrotizing hemorrhagic encephalomyelitis, and other diseases withdemyelination; degenerative diseases, such as degenerative diseasesaffecting the cerebral cortex, including Alzheimer disease and Pickdisease, degenerative diseases of basal ganglia and brain stem,including Parkinsonism, idiopathic Parkinson disease (paralysisagitans), progressive supranuclear palsy, corticobasal degenration,multiple system atrophy, including striatonigral degenration, Shy-Dragersyndrome, and olivopontocerebellar atrophy, and Huntington disease;spinocerebellar degenerations, including spinocerebellar ataxias,including Friedreich ataxia, and ataxia-telanglectasia, degenerativediseases affecting motor neurons, including amyotrophic lateralsclerosis (motor neuron disease), bulbospinal atrophy (Kennedysyndrome), and spinal muscular atrophy; inborn errors of metabolism,such as leukodystrophies, including Krabbe disease, metachromaticleukodystrophy, adrenoleukodystrophy, Pelizaeus-Merzbacher disease, andCanavan disease, mitochondrial encephalomyopathies, including Leighdisease and other mitochondrial encephalomyopathies; toxic and acquiredmetabolic diseases, including vitamin deficiencies such as thiamine(vitamin B₁) deficiency and vitamin B₁₂ deficiency, neurologic sequelaeof metabolic disturbances, including hypoglycemia, hyperglycemia, andhepatic encephatopathy, toxic disorders, including carbon monoxide,methanol, ethanol, and radiation, including combined methotrexate andradiation-induced injury; tumors, such as gliomas, includingastrocytoma, including fibrillary (diffuse) astrocytoma and glioblastomamultiforme, pilocytic astrocytoma, pleomorphic xanthoastrocytoma, andbrain stem glioma, oligodendroglioma, and ependymoma and relatedparaventricular mass lesions, neuronal tumors, poorly differentiatedneoplasms, including medulloblastoma, other parenchymal tumors,including primary brain lymphoma, germ cell tumors, and pinealparenchymal tumors, meningiomas, metastatic tumors, paraneoplasticsyndromes, peripheral nerve sheath tumors, including schwannoma,neurofibroma, and malignant peripheral nerve sheath tumor (malignantschwannoma), and neurocutaneous syndromes (phakomatoses), includingneurofibromotosis, including Type I neurofibromatosis (NF1) and TYPE 2neurofibromatosis (NF2), tuberous sclerosis, and Von Hippel-Lindaudisease.

[0779] As used herein, skeletal muscle disorders include, but are notlimited to, muscular dystrophy (e.g., Duchenne muscular dystrophy,Becker muscular dystrophy, Emery-Dreifuss muscular dystrophy,limb-girdle muscular dystrophy, facioscapulohumeral muscular dystrophy,myotonic dystrophy, oculopharyngeal muscular dystrophy, distal musculardystrophy, and congenital muscular dystrophy), motor neuron diseases(e.g., amyotrophic lateral sclerosis, infantile progressive spinalmuscular atrophy, intermediate spinal muscular atrophy, spinal bulbarmuscular atrophy, and adult spinal muscular atrophy), myopathies (e.g.,inflammatory myopathies (e.g., dermatomyositis and polymyositis),myotonia congenita, paramyotonia congenita, central core disease,nemaline myopathy, myotubular myopathy, and periodic paralysis), tumorssuch as rhabdomyosarcoma, and metabolic diseases of muscle (e.g.,phosphorylase deficiency, acid maltase deficiency, phosphofructokinasedeficiency, debrancher enzyme deficiency, mitochondrial myopathy,carnitine deficiency, carnitine palmityl transferase deficiency,phosphoglycerate kinase deficiency, phosphoglycerate mutase deficiency,lactate dehydrogenase deficiency, and myoadenylate deaminasedeficiency).

[0780] As used herein, an “endothelial cell disorder” includes adisorder characterized by aberrant, unregulated, or unwanted endothelialcell activity, e.g., proliferation, migration, angiogenesis, orvascularization; or aberrant expression of cell surface adhesionmolecules or genes associated with angiogenesis, e.g., TIE-2, FLT andFLK. Endothelial cell disorders include tumorigenesis, tumor metastasis,psoriasis, diabetic retinopathy, endometriosis, Grave's disease,ischemic disease (e.g., atherosclerosis), and chronic inflammatorydiseases (e.g., rheumatoid arthritis).

[0781] Disorders involving the liver (hepatic disorders) include, butare not limited to, hepatic injury; jaundice and cholestasis, such asbilirubin and bile formation; hepatic failure and cirrhosis, such ascirrhosis, portal hypertension, including ascites, portosystemic shunts,and splenomegaly; infectious disorders, such as viral hepatitis,including hepatitis A-E infection and infection by other hepatitisviruses, clinicopathologic syndromes, such as the carrier state,asymptomatic infection, acute viral hepatitis, chronic viral hepatitis,and fulminant hepatitis; autoimmune hepatitis; drug- and toxin-inducedliver disease, such as alcoholic liver disease; inborn errors ofmetabolism and pediatric liver disease, such as hemochromatosis, Wilsondisease, a₁-antitrypsin deficiency, and neonatal hepatitis; primary bileacid malabsorption; intrahepatic biliary tract disease, such assecondary biliary cirrhosis, primary biliary cirrhosis, primarysclerosing cholangitis, and anomalies of the biliary tree; circulatorydisorders, such as impaired blood flow into the liver, including hepaticartery compromise and portal vein obstruction and thrombosis, impairedblood flow through the liver, including passive congestion andcentrilobular necrosis and peliosis hepatis, hepatic vein outflowobstruction, including hepatic vein thrombosis (Budd-Chiari syndrome)and veno-occlusive disease; hepatic disease associated with pregnancy,such as preeclampsia and eclampsia, acute fatty liver of pregnancy, andintrehepatic cholestasis of pregnancy; hepatic complications of organ orbone marrow transplantation, such as drug toxicity after bone marrowtransplantation, graft-versus-host disease and liver rejection, andnonimmunologic damage to liver allografts; tumors and tumorousconditions, such as nodular hyperplasias, adenomas, and malignanttumors, including primary carcinoma of the liver and metastatic tumors.

[0782] Disorders which can be treated or diagnosed by methods describedherein include, but are not limited to, disorders associated with anaccumulation in the liver of fibrous tissue, such as that resulting froman imbalance between production and degradation of the extracellularmatrix accompanied by the collapse and condensation of preexistingfibers. The methods described herein can be used to diagnose or treathepatocellular necrosis or injury induced by a wide variety of agentsincluding processes which disturb homeostasis, such as an inflammatoryprocess, tissue damage resulting from toxic injury or altered hepaticblood flow, and infections (e.g., bacterial, viral and parasitic). Forexample, the methods can be used for the early detection of hepaticinjury, such as portal hypertension or hepatic fibrosis. In addition,the methods can be employed to detect liver fibrosis attributed toinborn errors of metabolism, for example, fibrosis resulting from astorage disorder such as Gaucher's disease (lipid abnormalities) or aglycogen storage disease, A1-antitrypsin deficiency; a disordermediating the accumulation (e.g., storage) of an exogenous substance,for example, hemochromatosis (iron-overload syndrome) and copper storagediseases (Wilson's disease), disorders resulting in the accumulation ofa toxic metabolite (e.g. tyrosinemia, fructosemia and galactosemia) andperoxisomal disorders (e.g., Zellweger syndrome). Additionally, themethods described herein can be used for the early detection andtreatment of liver injury associated with the administration of variouschemicals or drugs, such as for example, methotrexate, isonizaid,oxyphenisatin, methyldopa, chlorpromazine, tolbutamide or alcohol, orwhich represents a hepatic manifestation of a vascular disorder such asobstruction of either the intrahepatic or extrahepatic bile flow or analteration in hepatic circulation resulting, for example, from chronicheart failure, veno-occlusive disease, portal vein thrombosis orBudd-Chiari syndrome.

[0783] Additionally, the molecules of the invention can play animportant role in the etiology of certain viral diseases, including butnot limited to Hepatitis B, Hepatitis C and Herpes Simplex Virus (HSV).Modulators of the activity of the molecules of the invention could beused to control viral diseases. The modulators can be used in thetreatment and/or diagnosis of viral infected tissue or virus-associatedtissue fibrosis, especially liver and liver fibrosis. Also, suchmodulators can be used in the treatment and/or diagnosis ofvirus-associated carcinoma, especially hepatocellular cancer.

[0784] Disorders related to reduced platelet number, thrombocytopenia,include idiopathic thrombocytopenic purpura, including acute idiopathicthrombocytopenic purpura, drug-induced thrombocytopenia, {v-associatedthrombocytopenia, and thrombotic microangiopathies: thromboticthrombocytopenic purpura and hemolytic-uremic syndrome.

[0785] As used herein, neurological disorders include disorders of thecentral nervous system (CNS) and the peripheral nervous system, e.g.,cognitive and neurodegenerative disorders, Examples of neurologicaldisorders include, but are not limited to, autonomic function disorderssuch as hypertension and sleep disorders, and neuropsychiatricdisorders, such as depression, schizophrenia, schizoaffective disorder,Korsakoff's psychosis, alcoholism, anxiety disorders, or phobicdisorders; learning or memory disorders, e.g., amnesia or age-relatedmemory loss, attention deficit disorder, dysthymic disorder, majordepressive disorder, mania, obsessive-compulsive disorder, psychoactivesubstance use disorders, anxiety, phobias, panic disorder, as well asbipolar affective disorder, e.g., severe bipolar affective (mood)disorder (BP-1), and bipolar affective neurological disorders, e.g.,migraine and obesity. Such neurological disorders include, for example,disorders involving neurons, and disorders involving glia, such asastrocytes, oligodendrocytes, ependymal cells, and microglia; cerebraledema, raised intracranial pressure and herniation, and hydrocephalus;malformations and developmental diseases, such as neural tube defects,forebrain anomalies, posterior fossa anomalies, and syringomyelia andhydromyelia; perinatal brain injury; cerebrovascular diseases, such asthose related to hypoxia, ischemia, and infarction, includinghypotension, hypoperfusion, and low-flow states—global cerebral ischemiaand focal cerebral ischemia—infarction from obstruction of local bloodsupply, intracranial hemorrhage, including intracerebral(intraparenchymal) hemorrhage, subarachnoid hemorrhage and rupturedberry aneurysms, and vascular malformations, hypertensivecerebrovascular disease, including lacunar infarcts, slit hemorrhages,and hypertensive encephalopathy; infections, such as acute meningitis,including acute pyogenic (bacterial) meningitis and acute aseptic(viral) meningitis, acute focal suppurative infections, including brainabscess, subdural empyema, and extradural abscess, chronic bacterialmeningoencephalitis, including tuberculosis and mycobacterioses,neurosyphilis, and neuroborreliosis (Lyme disease), viralmeningoencephalitis, including arthropod-borne (Arbo) viralencephalitis, Herpes simplex virus Type 1, Herpes simplex virus Type 2,Varicella-zoster virus (Herpes zoster), cytomegalovirus, poliomyelitis,rabies, and human immunodeficiency virus 1, including HIV-1meningoencephalitis (subacute encephalitis), vacuolar myelopathy,AIDS-associated myopathy, peripheral neuropathy, and AIDS in children,progressive multifocal leukoencephalopathy, subacute sclerosingpanencephalitis, fungal meningoencephalitis, other infectious diseasesof the nervous system; transmissible spongiform encephalopathies (priondiseases); demyelinating diseases, including multiple sclerosis,multiple sclerosis variants, acute disseminated encephalomyelitis andacute necrotizing hemorrhagic encephalomyelitis, and other diseases withdemyelination; degenerative diseases, such as degenerative diseasesaffecting the cerebral cortex, including Alzheimer's disease and Pick'sdisease, degenerative diseases of basal ganglia and brain stem,including Parkinsonism, idiopathic Parkinson's disease (paralysisagitans) and other Lewy diffuse body diseases, progressive supranuclearpalsy, corticobasal degenration, multiple system atrophy, includingstriatonigral degenration, Shy-Drager syndrome, and olivopontocerebellaratrophy, and Huntington's disease, senile dementia, Gilles de laTourette's syndrome, epilepsy, and Jakob-Creutzfieldt disease;spinocerebellar degenerations, including spinocerebellar ataxias,including Friedreich ataxia, and ataxia-telanglectasia, degenerativediseases affecting motor neurons, including amyotrophic lateralsclerosis (motor neuron disease), bulbospinal atrophy (Kennedysyndrome), and spinal muscular atrophy; inborn errors of metabolism,such as leukodystrophies, including Krabbe disease, metachromaticleukodystrophy, adrenoleukodystrophy, Pelizaeus-Merzbacher disease, andCanavan disease, mitochondrial encephalomyopathies, including Leighdisease and other mitochondrial encephalomyopathies; toxic and acquiredmetabolic diseases, including vitamin deficiencies such as thiamine(vitamin B₁) deficiency and vitamin B₁₂ deficiency, neurologic sequelaeof metabolic disturbances, including hypoglycemia, hyperglycemia, andhepatic encephatopathy, toxic disorders, including carbon monoxide,methanol, ethanol, and radiation, including combined methotrexate andradiation-induced injury; tumors, such as gliomas, includingastrocytoma, including fibrillary (diffuse) astrocytoma and glioblastomamultiforme, pilocytic astrocytoma, pleomorphic xanthoastrocytoma, andbrain stem glioma, oligodendroglioma, and ependymoma and relatedparaventricular mass lesions, neuronal tumors, poorly differentiatedneoplasms, including medulloblastoma, other parenchymal tumors,including primary brain lymphoma, germ cell tumors, and pinealparenchymal tumors, meningiomas, metastatic tumors, paraneoplasticsyndromes, peripheral nerve sheath tumors, including schwannoma,neurofibroma, and malignant peripheral nerve sheath tumor (malignantschwannoma), and neurocutaneous syndromes (phakomatoses), includingneurofibromotosis, including Type 1 neurofibromatosis (NF1) and TYPE 2neurofibromatosis (NF2), tuberous sclerosis, and Von Hippel-Lindaudisease. Further CNS-related disorders include, for example, thoselisted in the American Psychiatric Association's Diagnostic andStatistical manual of Mental Disorders (DSM), the most current versionof which is incorporated herein by reference in its entirety.

[0786] As used herein, diseases of the skin (dermal disorders), includebut are not limited to, disorders of pigmentation and melanocytes,including but not limited to, vitiligo, freckle, melasma, lentigo,nevocellular nevus, dysplastic nevi, and malignant melanoma; benignepithelial tumors, including but not limited to, seborrheic keratoses,acanthosis nigricans, fibroepithelial polyp, epithelial cyst,keratoacanthoma, and adnexal (appendage) tumors; premalignant andmalignant epidermal tumors, including but not limited to, actinickeratosis, squamous cell carcinoma, basal cell carcinoma, and merkelcell carcinoma; tumors of the dermis, including but not limited to,benign fibrous histiocytoma, dermatofibrosarcoma protuberans, xanthomas,and dermal vascular tumors; tumors of cellular immigrants to the skin,including but not limited to, histiocytosis X, mycosis fungoides(cutaneous T-cell lymphoma), and mastocytosis; disorders of epidermalmaturation, including but not limited to, ichthyosis; acute inflammatorydermatoses, including but not limited to, urticaria, acute eczematousdermatitis, and erythema multiforme; chronic inflammatory dermatoses,including but not limited to, psoriasis, lichen planus, and lupuserythematosus; blistering (bullous) diseases, including but not limitedto, pemphigus, bullous pemphigoid, dermatitis herpetiformis, andnoninflammatory blistering diseases: epidermolysis bullosa andporphyria; disorders of epidermal appendages, including but not limitedto, acne vulgaris; panniculitis, including but not limited to, erythemanodosum and erythema induratum; and infection and infestation, such asverrucae, molluscum contagiosum, impetigo, superficial fungalinfections, and arthropod bites, stings, and infestations.

[0787] Additionally, molecules of the invention can play an importantrole in the regulation of metabolism or pain disorders. Diseases ofmetabolic imbalance include, but are not limited to, obesity, anorexianervosa, cachexia, lipid disorders, and diabetes. Examples of paindisorders include, but are not limited to, pain response elicited duringvarious forms of tissue injury, e.g., inflammation, infection, andischemia, usually referred to as hyperalgesia (described in, forexample, Fields (1987) Pain, New York:McGraw-Hill); pain associated withmusculoskeletal disorders, e.g., joint pain; tooth pain; headaches; painassociated with surgery; pain related to irritable bowel syndrome; orchest pain.

[0788] As used herein, the term “erythroid associated disorders” includedisorders involving aberrant (increased or deficient) erythroblastproliferation, e.g., an erythroleukemia, and aberrant (increased ordeficient) erythroblast differentiation, e.g., an anemia.Erythrocyte-associated disorders include anemias such as, for example,drug- (chemotherapy-) induced anemias, hemolytic anemias due tohereditary cell membrane abnormalities, such as hereditaryspherocytosis, hereditary elliptocytosis, and hereditarypyropoikilocytosis; hemolytic anemias due to acquired cell membranedefects, such as paroxysmal nocturnal hemoglobinuria and spur cellanemia; hemolytic anemias caused by antibody reactions, for example tothe RBC antigens, or antigens of the ABO system, Lewis system, Iisystem, Rh system, Kidd system, Duffy system, and Kell system;methemoglobinemia; a failure of erythropoiesis, for example, as a resultof aplastic anemia, pure red cell aplasia, myelodysplastic syndromes,sideroblastic anemias, and congenital dyserythropoietic anemia;secondary anemia in non-hematolic disorders, for example, as a result ofchemotherapy, alcoholism, or liver disease; anemia of chronic disease,such as chronic renal failure; and endocrine deficiency diseases.Another example of an erythroid-associated disorder is erythrocytosis.Erythrocytosis, a disorder of red blood cell overproduction caused byexcessive and/or ectopic erythropoietin production, can be caused bycancers, e.g., a renal cell cancer, a hepatocarcinoma, and a centralnervous system cancer. Diseases associated with erythrocytosis includepolycythemias, e.g., polycythemia vera, secondary polycythemia, andrelative polycythemia.

[0789] As used herein, an “angiogenesis disorder” includes a disease ordisorder which affects or is caused by aberrant or deficientangiogenesis. Disorders involving angiogenesis include, but are notlimited to, aberrant or excess angiogenesis in tumors such ashemangiomas and Kaposi's sarcoma, von Hippel-Lindau disease, as well asthe angiogenesis associated with tumor growth; aberrant or excessangiogenesis in diseases such as a Castleman's disease or fibrodysplasiaossificans progressiva; aberrant or deficient angiogenesis associatedwith aging, complications of healing certain wounds and complications ofdiseases such as diabetes and rheumatoid arthritis; or aberrant ordeficient angiogenesis associated with hereditary hemorrhagictelangiectasia, autosomal dominant polycystic kidney disease,myelodysplastic syndrome or Klippel-Trenaunay-Weber syndrome.

[0790] As used herein, disorders involving the spleen include, but arenot limited to, splenomegaly, including nonspecific acute splenitis,congestive spenomegaly, and spenic infarcts; neoplasms, congenitalanomalies, and rupture. Disorders associated with splenomegaly includeinfections, such as nonspecific splenitis, infectious mononucleosis,tuberculosis, typhoid fever, brucellosis, cytomegalovirus, syphilis,malaria, histoplasmosis, toxoplasmosis, kala-azar, trypanosomiasis,schistosomiasis, leishmaniasis, and echinococcosis; congestive statesrelated to partial hypertension, such as cirrhosis of the liver, portalor splenic vein thrombosis, and cardiac failure; lymphohematogenousdisorders, such as Hodgkin disease, non-Hodgkin lymphomas/leukemia,multiple myeloma, myeloproliferative disorders, hemolytic anemias, andthrombocytopenic purpura; immunologic-inflammatory conditions, such asrheumatoid arthritis and systemic lupus erythematosus; storage diseasessuch as Gaucher disease, Niemann-Pick disease, andmucopolysaccharidoses; and other conditions, such as amyloidosis,primary neoplasms and cysts, and secondary neoplasms.

[0791] As used herein, disorders involving blood vessels include, butare not limited to, responses of vascular cell walls to injury, such asendothelial dysfunction and endothelial activation and intimalthickening; vascular diseases including, but not limited to, congenitalanomalies, such as arteriovenous fistula, atherosclerosis, andhypertensive vascular disease, such as hypertension; inflammatorydisease—the vasculitides, such as giant cell (temporal) arteritis,Takayasu arteritis, polyarteritis nodosa (classic), Kawasaki syndrome(mucocutaneous lymph node syndrome), microscopic polyanglitis(microscopic polyarteritis, hypersensitivity or leukocytoclasticanglitis), Wegener granulomatosis, thromboanglitis obliterans (Buergerdisease), vasculitis associated with other disorders, and infectiousarteritis; Raynaud disease; aneurysms and dissection, such as abdominalaortic aneurysms, syphilitic (luetic) aneurysms, and aortic dissection(dissecting hematoma); disorders of veins and lymphatics, such asvaricose veins, thrombophlebitis and phlebothrombosis, obstruction ofsuperior vena cava (superior vena cava syndrome), obstruction ofinferior vena cava (inferior vena cava syndrome), and lymphangitis andlymphedema; tumors, including benign tumors and tumor-like conditions,such as hemangioma, lymphangioma, glomus tumor (glomangioma), vascularectasias, and bacillary angiomatosis, and intermediate-grade (borderlinelow-grade malignant) tumors, such as Kaposi sarcoma andhemangloendothelioma, and malignant tumors, such as angiosarcoma andhemangiopericytoma; and pathology of therapeutic interventions invascular disease, such as balloon angioplasty and related techniques andvascular replacement, such as coronary artery bypass graft surgery.

[0792] As used herein, disorders involving the testis and epididymisinclude, but are not limited to, congenital anomalies such ascryptorchidism, regressive changes such as atrophy, inflammations suchas nonspecific epididymitis and orchitis, granulomatous (autoimmune)orchitis, and specific inflammations including, but not limited to,gonorrhea, mumps, tuberculosis, and syphilis, vascular disturbancesincluding torsion, testicular tumors including germ cell tumors thatinclude, but are not limited to, seminoma, spermatocytic seminoma,embryonal carcinoma, yolk sac tumor choriocarcinoma, teratoma, and mixedtumors, tumore of sex cord-gonadal stroma including, but not limited to,Leydig (interstitial) cell tumors and sertoli cell tumors(androblastoma), and testicular lymphoma, and miscellaneous lesions oftunica vaginalis.

[0793] As used herein, disorders involving the thymus includedevelopmental disorders, such as DiGeorge syndrome with thymichypoplasia or aplasia; thymic cysts; thymic hypoplasia, which involvesthe appearance of lymphoid follicles within the thymus, creating thymicfollicular hyperplasia; and thymomas, including germ cell tumors,lynphomas, Hodgkin disease, and carcinoids. Thymomas can include benignor encapsulated thymoma, and malignant thymoma Type I (invasive thymoma)or Type II, designated thymic carcinoma.

[0794] As used herein, disorders involving the thyroid include, but arenot limited to, hyperthyroidism; hypothyroidism including, but notlimited to, cretinism and myxedema; thyroiditis including, but notlimited to, hashimoto thyroiditis, subacute (granulomatous) thyroiditis,and subacute lymphocytic (painless) thyroiditis; Graves disease; diffuseand multinodular goiter including, but not limited to, diffuse nontoxic(simple) goiter and multinodular goiter; neoplasms of the thyroidincluding, but not limited to, adenomas, other benign tumors, andcarcinomas, which include, but are not limited to, papillary carcinoma,follicular carcinoma, medullary carcinoma, and anaplastic carcinoma; andcogenital anomalies.

[0795] As used herein, disorders related to reduced platelet number,thrombocytopenia, include idiopathic thrombocytopenic purpura, includingacute idiopathic thrombocytopenic purpura, drug-inducedthrombocytopenia, HIV-associated thrombocytopenia, and thromboticmicroangiopathies: thrombotic thrombocytopenic purpura andhemolytic-uremic syndrome.

[0796] Various aspects of the invention are described in further detailbelow.

[0797] Isolated Nucleic Acid Molecules

[0798] In one aspect, the invention provides, an isolated or purified,nucleic acid molecule that encodes a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 polypeptide described herein, e.g., afull length 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein or a fragment thereof, e.g., a biologically active portionof 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein. Also included is a nucleic acid fragment suitable for use as ahybridization probe, which can be used, e.g., to identify a nucleic acidmolecule encoding a polypeptide of the invention, 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 mRNA, and fragmentssuitable for use as primers, e.g., PCR primers for the amplification ormutation of nucleic acid molecules.

[0799] In one embodiment, an isolated nucleic acid molecule of theinvention includes the nucleotide sequence shown in SEQ ID NO:1, 3, 5,7, 10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49,51, 54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109,111 or 113, or a portion of any of this nucleotide sequence. In oneembodiment, the nucleic acid molecule includes sequences encoding thehuman 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein (i.e., “the coding region” of SEQ ID NO:1, 5, 10, 18, 21,24, 31, 39, 43, 46, 49, 54, 57, 63, 66, 71, 88, 104, 107 or 111, asshown in SEQ ID NO:3, 7, 12, 20, 23, 26, 33, 41, 45, 48, 51, 56, 59, 65,68, 73, 90, 106, 109 or 113, respectively), as well as 5′ untranslatedsequences and 3′ untranslated sequences. Alternatively, the nucleic acidmolecule can include only the coding region of SEQ ID NO:1, 5, 10, 18,21, 24, 31, 39, 43, 46, 49, 54, 57, 63, 66, 71, 88, 104, 107 or 11(e.g., SEQ ID NO:3, 7, 12, 20, 23, 26, 33, 41, 45, 48, 51, 56, 59, 65,68, 73, 90, 106, 109 or 113) and, e.g., no flanking sequences whichnormally accompany the subject sequence. In another embodiment, thenucleic acid molecule encodes a sequence corresponding to a fragment ofthe protein corresponding to domains within SEQ ID NO:2, 6, 11, 19, 22,25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112.

[0800] In another embodiment, an isolated nucleic acid molecule of theinvention includes a nucleic acid molecule which is a complement of thenucleotide sequence shown in SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21,23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63,65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113, or a portionof any of these nucleotide sequences. In other embodiments, the nucleicacid molecule of the invention is sufficiently complementary to thenucleotide sequence shown in SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21,23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63,65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113 such that itcan hybridize to the nucleotide sequence shown in SEQ ID NO:1, 3, 5, 7,10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51,54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111or 113, thereby forming a stable duplex.

[0801] In one embodiment, an isolated nucleic acid molecule of thepresent invention includes a nucleotide sequence which is at leastabout: 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%,97%, 98%, 99%, or more homologous to the entire length of the nucleotidesequence shown in SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26,31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68,71, 73, 88, 90, 104, 106, 107, 109, 111 or 113, or a portion, preferablyof the same length, of any of these nucleotide sequences.

[0802] 21910, 56634.55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 Nucleic Acid Fragments

[0803] A nucleic acid molecule of the invention can include only aportion of the nucleic acid sequence of SEQ ID NO:1, 3, 5, 7, 10, 12,18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56,57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113.For example, such a nucleic acid molecule can include a fragment whichcan be used as a probe or primer or a fragment encoding a portion of a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein, e.g., an immunogenic or biologically active portion of a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein. Afragment can comprise those nucleotides of SEQ ID NO:1, 3, 5, 7, 10, 12,18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56,57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113,which encode a domain of human 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593. The nucleotide sequence determined from thecloning of the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 gene allows for the generation of probes and primersdesigned for use in identifying and/or cloning other 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 family members,or fragments thereof, as well as 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 homologs, or fragments thereof, fromother species.

[0804] In another embodiment, a nucleic acid includes a nucleotidesequence that includes part, or all, of the coding region and extendsinto either (or both) the 5′ or 3′ noncoding region. Other embodimentsinclude a fragment which includes a nucleotide sequence encoding anamino acid fragment described herein. Nucleic acid fragments can encodea specific domain or site described herein or fragments thereof,particularly fragments thereof which are at least 100 amino acids inlength. Fragments also include nucleic acid sequences corresponding tospecific amino acid sequences described above or fragments thereof.Nucleic acid fragments should not to be construed as encompassing thosefragments that may have been disclosed prior to the invention.

[0805] A nucleic acid fragment can include a sequence corresponding to adomain, region, or functional site described herein. A nucleic acidfragment can also include one or more domain, region, or functional sitedescribed herein. Thus, for example, a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 nucleic acid fragment can include asequence corresponding to a domain, as described herein.

[0806] 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 probes and primers are provided. Typically a probe/primer is anisolated or purified oligonucleotide. The oligonucleotide typicallyincludes a region of nucleotide sequence that hybridizes under stringentconditions to at least about 7, 12 or 15, preferably about 20 or 25,more preferably about 30, 35, 40, 45, 50, 55, 60, 65, or 75 consecutivenucleotides of a sense or antisense sequence of SEQ ID NO:1, 3, 5, 7,10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51,54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111or 113, or of a naturally occurring allelic variant or mutant of SEQ IDNO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45,46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104,106, 107, 109, 111 or 113.

[0807] In a preferred embodiment the nucleic acid is a probe which is atleast 5 or 10, and less than 200, more preferably less than 100, or lessthan 50, base pairs in length. It should be identical, or differ by 1,or less than in 5 or 10 bases, from a sequence disclosed herein. Ifalignment is needed for this comparison the sequences should be alignedfor maximum homology. “Looped” out sequences from deletions orinsertions, or mismatches, are considered differences.

[0808] A probe or primer can be derived from the sense or anti-sensestrand of a nucleic acid which encodes a domain identified in the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 sequences.

[0809] In another embodiment a set of primers is provided, e.g., primerssuitable for use in a PCR, which can be used to amplify a selectedregion of a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 sequence, e.g., a domain, region, site or other sequence describedherein. The primers should be at least 5, 10, or 50 base pairs in lengthand less than 100, or less than 200, base pairs in length. The primersshould be identical, or differ by one base from a sequence disclosedherein or from a naturally occurring variant.

[0810] A nucleic acid fragment can encode an epitope bearing region of apolypeptide described herein.

[0811] A nucleic acid fragment encoding a “biologically active portionof a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593polypeptide” can be prepared by isolating a portion of the nucleotidesequence of SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31,33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71,73, 88, 90, 104, 106, 107, 109, 111 or 113, which encodes a polypeptidehaving a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 biological activity (e.g., the biological activities of the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 proteinsare described herein), expressing the encoded portion of the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein(e.g., by recombinant expression in vitro) and assessing the activity ofthe encoded portion of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein. A nucleic acid fragment encoding abiologically active portion of a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 polypeptide, can comprise a nucleotidesequence which is greater than 300 or more nucleotides in length.

[0812] In preferred embodiments, a nucleic acid includes a nucleotidesequence which is about 300, 400, 500, 600, 700, 800, 900, 1000, 1100,1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300,2400, 2500, 2600, 2700, 2800, 2900, 3000, 3100, 3200, 3300, 3400, 3500,3600, 3700, 3800, 3900, 4000, 4100, 4200, 4300, 4400, 4500, 4600, 4700,4800, 4900, 5000, 5100, 5200, 5300, 5400, 5500, 5600, 5700, 5800, 5900,6000, 6100, 6200, 6300, 6400, 6500, 6600, 6700, 6800, 6900, 7000, 7100,7200, 7300 or more nucleotides in length and hybridizes under stringenthybridization conditions to a nucleic acid molecule of SEQ ID NO:1, 3,5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48,49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107,109, 111 or 113.

[0813] 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343. 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 Nucleic Acid Variants

[0814] The invention further encompasses nucleic acid molecules thatdiffer from the nucleotide sequence shown in SEQ ID NO:1, 3, 5, 7, 10,12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54,56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or113. Such differences can be due to degeneracy of the genetic code (andresult in a nucleic acid which encodes the same 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 proteins as thoseencoded by the nucleotide sequence disclosed herein. In anotherembodiment, an isolated nucleic acid molecule of the invention has anucleotide sequence encoding a protein having an amino acid sequencewhich differs, by at least 1, but less than 5, 10, 20, 50, or 100 aminoacid residues that shown in SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44,47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112. If alignment is neededfor this comparison the sequences should be aligned for maximumhomology. “Looped” out sequences from deletions or insertions, ormismatches, are considered differences.

[0815] Nucleic acids of the inventor can be chosen for having codons,which are preferred, or non-preferred, for a particular expressionsystem. E.g., the nucleic acid can be one in which at least one codon,at preferably at least 10%, or 20% of the codons has been altered suchthat the sequence is optimized for expression in E. coli, yeast, human,insect, or CHO cells.

[0816] Nucleic acid variants can be naturally occurring, such as allelicvariants (same locus), homologs (different locus), and orthologs(different organism) or can be non naturally occurring. Non-naturallyoccurring variants can be made by mutagenesis techniques, includingthose applied to polynucleotides, cells, or organisms. The variants cancontain nucleotide substitutions, deletions, inversions and insertions.Variation can occur in either or both the coding and non-coding regions.The variations can produce both conservative and non-conservative aminoacid substitutions (as compared in the encoded product).

[0817] In a preferred embodiment, the nucleic acid differs from that ofSEQ ID NO: 1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41,43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90,104, 106, 107, 109, 111 or 113, e.g., as follows: by at least one butless than 10, 20, 30, or 40 nucleotides; at least one but less than 1%,5%, 10% or 20% of the nucleotides in the subject nucleic acid. Ifnecessary for this analysis the sequences should be aligned for maximumhomology. “Looped” out sequences from deletions or insertions, ormismatches, are considered differences.

[0818] Orthologs, homologs, and allelic variants can be identified usingmethods known in the art. These variants comprise a nucleotide sequenceencoding a polypeptide that is 50%, at least about 55%, typically atleast about 70-75%, more typically at least about 80-85%, and mosttypically at least about 90-95% or more identical to the nucleotidesequence shown in SEQ ID NO: 1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26,31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68,71, 73, 88, 90, 104, 106, 107, 109, 111 or 113 or a fragment of thissequence. Such nucleic acid molecules can readily be identified as beingable to hybridize under stringent conditions, to the nucleotide sequenceshown in SEQ ID NO: 1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31, 33,39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71, 73,88, 90, 104, 106, 107, 109, 111 or 113 or a fragment of the sequence.Nucleic acid molecules corresponding to orthologs, homologs, and allelicvariants of the 21910, 56634, 33217, 21967, h1983, m1983, 38555 or 593cDNAs of the invention can further be isolated by mapping to the samechromosome or locus as the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 gene.

[0819] Preferred variants include those that are correlated withactivities specific to the molecules of the invention, i.e. guanylatekinase activity, phophatidylinositol 4-phosphate 5-kinase activity,kinase activity, transferase activity, aminopeptidase activity,adenylate cyclase activity, calpain protease activity, oxidoreductaseactivity, neprilysin protease activity, AMP binding enzyme activity andlysyl oxidase activity, or other activity.

[0820] Allelic variants of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593, e.g., human 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593, include both functional andnon-functional proteins. Functional allelic variants are naturallyoccurring amino acid sequence variants of the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein within a populationthat maintain the ability to bind a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 ligand or substrate and/or modulatecell proliferation and/or migration mechanisms. Functional allelicvariants will typically contain only conservative substitution of one ormore amino acids of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50,55, 58, 64, 67, 72, 89, 105, 108 or 112, or substitution, deletion orinsertion of non-critical residues in non-critical regions of theprotein. Non-functional allelic variants are naturally-occurring aminoacid sequence variants of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593, e.g., human 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593, protein within a populationthat do not have the ability to bind a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 ligand or substrate and/or modulatecell proliferation and/or migration mechanisms. Non-functional allelicvariants will typically contain a non-conservative substitution, adeletion, or insertion, or premature truncation of the amino acidsequence of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58,64, 67, 72, 89, 105, 108 or 112, or a substitution, insertion, ordeletion in critical residues or critical regions of the protein.

[0821] Moreover, nucleic acid molecules encoding other 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 family membersand, thus, which have a nucleotide sequence which differs from the21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593sequences of SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31,33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71,73, 88, 90, 104, 106, 107, 109, 111 or 113 are intended to be within thescope of the invention.

[0822] Antisense Nucleic Acid Molecules, Ribozymes and Modified 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 NucleicAcid Molecules

[0823] In another aspect, the invention features, an isolated nucleicacid molecule which is antisense to 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593. An “antisense” nucleic acid caninclude a nucleotide sequence which is complementary to a “sense”nucleic acid encoding a protein, e.g., complementary to the codingstrand of a double-stranded cDNA molecule or complementary to an mRNAsequence. The antisense nucleic acid can be complementary to an entire21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593coding strand, or to only a portion thereof (e.g., the coding region ofhuman 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 corresponding to SEQ ID NO:3, 7, 12, 20, 23, 26, 33, 41, 45, 48, 51,56, 59, 65, 68, 73, 90, 106, 109 or 113, respectively). In anotherembodiment, the antisense nucleic acid molecule is antisense to a“noncoding region” of the coding strand of a nucleotide sequenceencoding 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 (e.g., the 5′ and 3′ untranslated regions).

[0824] An antisense nucleic acid can be designed such that it iscomplementary to the entire coding region of 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 mRNA, but more preferably is anoligonucleotide which is antisense to only a portion of the coding ornoncoding region of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 mRNA. For example, the antisense oligonucleotide canbe complementary to the region surrounding the translation start site of21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593mRNA, e.g., between the −10 and +10 regions of the target genenucleotide sequence of interest. An antisense oligonucleotide can be,for example, about 7, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65,70, 75, 80, or more nucleotides in length.

[0825] An antisense nucleic acid of the invention can be constructedusing chemical synthesis and enzymatic ligation reactions usingprocedures known in the art. For example, an antisense nucleic acid(e.g., an antisense oligonucleotide) can be chemically synthesized usingnaturally occurring nucleotides or variously modified nucleotidesdesigned to increase the biological stability of the molecules or toincrease the physical stability of the duplex formed between theantisense and sense nucleic acids, e.g., phosphorothioate derivativesand acridine substituted nucleotides can be used. The antisense nucleicacid also can be produced biologically using an expression vector intowhich a nucleic acid has been subcloned in an antisense orientation(i.e., RNA transcribed from the inserted nucleic acid will be of anantisense orientation to a target nucleic acid of interest, describedfurther in the following subsection).

[0826] The antisense nucleic acid molecules of the invention aretypically administered to a subject (e.g., by direct injection at atissue site), or generated in situ such that they hybridize with or bindto cellular mRNA and/or genomic DNA encoding a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein to therebyinhibit expression of the protein, e.g., by inhibiting transcriptionand/or translation. Alternatively, antisense nucleic acid molecules canbe modified to target selected cells and then administered systemically.For systemic administration, antisense molecules can be modified suchthat they specifically or selectively bind to receptors or antigensexpressed on a selected cell surface, e.g., by linking the antisensenucleic acid molecules to peptides or antibodies which bind to cellsurface receptors or antigens. The antisense nucleic acid molecules canalso be delivered to cells using the vectors described herein. Toachieve sufficient intracellular concentrations of the antisensemolecules, vector constructs in which the antisense nucleic acidmolecule is placed under the control of a strong pol II or pol IIIpromoter are preferred.

[0827] In yet another embodiment, the antisense nucleic acid molecule ofthe invention is an α-anomeric nucleic acid molecule. An α-anomericnucleic acid molecule forms specific double-stranded hybrids withcomplementary RNA in which, contrary to the usual β-units, the strandsrun parallel to each other (Gaultier et al. (1987) Nucleic Acids. Res.15:6625-6641). The antisense nucleic acid molecule can also comprise a2′-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res.15:6131-6148) or a chimeric RNA-DNA analogue (Inoue et al. (1987) FEBSLett. 215:327-330).

[0828] In still another embodiment, an antisense nucleic acid of theinvention is a ribozyme. A ribozyme having specificity for a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593-encodingnucleic acid can include one or more sequences complementary to thenucleotide sequence of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 cDNA disclosed herein (i.e., SEQ ID NO:1, 3, 5, 7,10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51,54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111or 113), and a sequence having known catalytic sequence responsible formRNA cleavage (see U.S. Pat. No. 5,093,246 or Haselhoff and Gerlach(1988) Nature 334:585591). For example, a derivative of a TetrahymenaL-19 IVS RNA can be constructed in which the nucleotide sequence of theactive site is complementary to the nucleotide sequence to be cleaved ina 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593-encoding mRNA. See, e.g., Cech et al. U.S. Pat. No. 4,987,071; andCech et al. U.S. Pat. No. 5,116,742. Alternatively, 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 mRNA can be used toselect a catalytic RNA having a specific ribonuclease activity from apool of RNA molecules. See, e.g., Bartel and Szostak (1993) Science261:1411-1418.

[0829] 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 gene expression can be inhibited by targeting nucleotide sequencescomplementary to the regulatory region of the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 (e.g., the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 promoter and/orenhancers) to form triple helical structures that prevent transcriptionof the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 gene in target cells. See generally, Helene (1991) Anticancer DrugDes. 6:569-84; Helene (1992) Ann. N.Y. Acad. Sci. 660:27-36; and Maher(1992) Bioassays 14:807-15. The potential sequences that can be targetedfor triple helix formation can be increased by creating a so-called“switchback” nucleic acid molecule. Switchback molecules are synthesizedin an alternating 5′-3′,3′-5′ manner, such that they base pair withfirst one strand of a duplex and then the other, eliminating thenecessity for a sizeable stretch of either purines or pyrimidines to bepresent on one strand of a duplex.

[0830] The invention also provides detectably labeled oligonucleotideprimer and probe molecules. Typically, such labels are chemiluminescent,fluorescent, radioactive, or colorimetric.

[0831] A 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 nucleic acid molecule can be modified at the base moiety, sugarmoiety or phosphate backbone to improve, e.g., the stability,hybridization, or solubility of the molecule. For example, thedeoxyribose phosphate backbone of the nucleic acid molecules can bemodified to generate peptide nucleic acids (see Hyrup et al. (1996)Bioorganic & Medicinal Chemistry 4: 5-23).

[0832] As used herein, the terms “peptide nucleic acid” or “PNA” refersto a nucleic acid mimic, e.g., a DNA mimic, in which the deoxyribosephosphate backbone is replaced by a pseudopeptide backbone and only thefour natural nucleobases are retained. The neutral backbone of a PNA canallow for specific hybridization to DNA and RNA under conditions of lowionic strength. The synthesis of PNA oligomers can be performed usingstandard solid phase peptide synthesis protocols as described in Hyrupet al. (1996) supra; Perry-O'Keefe et al. (1996) Proc. Natl. Acad. Sci.93: 14670-675.

[0833] PNAs of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 nucleic acid molecules can be used in therapeutic anddiagnostic applications. For example, PNAs can be used as antisense orantigene agents for sequence-specific modulation of gene expression by,for example, inducing transcription or translation arrest or inhibitingreplication. PNAs of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 nucleic acid molecules can also be used in theanalysis of single base pair mutations in a gene, (e.g., by PNA-directedPCR clamping); as ‘artificial restriction enzymes’ when used incombination with other enzymes, (e.g., S1 nucleases (Hyrup et al. (1996)supra)); or as probes or primers for DNA sequencing or hybridization(Hyrup et al. (1996) supra; Perry-O'Keefe supra).

[0834] In other embodiments, the oligonucleotide can include otherappended groups such as peptides (e.g., for targeting host cellreceptors in vivo), or agents facilitating transport across the cellmembrane (see, e.g., Letsinger et al. (1989) Proc. Natl. Acad. Sci. USA86:6553-6556; Lemaitre et al. (1987) Proc. Natl. Acad. Sci. USA84:648-652; PCT Publication No. WO88/09810) or the blood-brain barrier(see, e.g., PCT Publication No. WO89/10134). In addition,oligonucleotides can be modified with hybridization-triggered cleavageagents (see, e.g., Krol et al. (1988) Bio-Techniques 6:958-976) orintercalating agents. (see, e.g., Zon (1988) Pharm. Res. 5:539-549). Tothis end, the oligonucleotide can be conjugated to another molecule,(e.g., a peptide, hybridization triggered cross-linking agent, transportagent, or hybridization-triggered cleavage agent).

[0835] The invention also includes molecular beacon oligonucleotideprimer and probe molecules having at least one region which iscomplementary to a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 nucleic acid of the invention, two complementaryregions one having a fluorophore and one a quencher such that themolecular beacon is useful for quantitating the presence of the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleicacid of the invention in a sample. Molecular beacon nucleic acids aredescribed, for example, in Lizardi et al., U.S. Pat. No. 5,854,033;Nazarenko et al., U.S. Pat. No. 5,866,336, and Livak et al., U.S. Pat.No. 5,876,930.

[0836] Isolated 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 Polypeptides

[0837] In another aspect, the invention features, an isolated 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein,or fragment, e.g., a biologically active portion, for use as immunogensor antigens to raise or test (or more generally to bind) anti-21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593antibodies. 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein can be isolated from cells or tissue sources using standardprotein purification techniques. 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 protein or fragments thereof can beproduced by recombinant DNA techniques or synthesized chemically.

[0838] Polypeptides of the invention include those which arise as aresult of the existence of multiple genes, alternative transcriptionevents, alternative RNA splicing events, and alternative translationaland post-translational events. The polypeptide can be expressed insystems, e.g., cultured cells, which result in substantially the samepost-translational modifications present when the polypeptide isexpressed in a native cell, or in systems which result in the alterationor omission of post-translational modifications, e.g., glycosylation orcleavage, present in a native cell.

[0839] In a preferred embodiment, a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 polypeptide has one or more of thefollowing characteristics: it has the ability: (1) modulateATP-dependent phosphorylation of GMP, dGMP, or cGMP; (2) catalyze theformation of phosphoinositol-4,5-bisphosphate via the phosphorylation ofphosphatidylinositol-4-phosphate; (3) mediate the phosphoinositidesignaling cascade; (4) convert a substrate or target molecule to aproduct (e.g., transfer of a phosphate group to a substrate or targetmolecule, or conversion of ATP to ADP); (5) interact with and/orphosphate transfer to a second protein; (6) modulate intra- orintercellular signaling and/or gene transcription (e.g., either directlyor indirectly); (7) modulate the phosphorylation state of targetmolecules (e.g., a kinase or a phosphatase molecule) or thephosphorylation state of one or more proteins involved in cellulargrowth, metabolism, or differentiation, e.g., cardiac, epithelial, orneuronal cell growth or differentiation; (8) convert a substrate ortarget molecule to a product (e.g., transfer of a methyl group to orfrom the substrate or target molecule); (9) interact with and/or methyltransfer to a second target molecule e.g., a nucleic acid molecule(e.g., DNA or RNA), a small organic molecule (e.g., a hormone,neurotransmitter or a coenzyme) or a protein; 10) cleave a proteinprecursor to maturation; (11) catalyze protein degradation; (12)catalyze the formation of a covalent bond within or between an aminoacid residue (e.g., a serine or threonine residue) and a phosphatemoiety; (13) modulate the cAMP signal transduction pathway; (14)modulate a target cell's cAMP concentration; (15) modulatecAMP-dependent protein kinase activity, such as protein kinase A; (16)modulate a calpain protease response; (17) modulate metabolism andcatabolism of biochemical molecules, e.g., molecules necessary forenergy production or storage; (18) modulate betaine synthesis fromcholine; (19) modulate methionine synthesis from homocysteine; (20)modulate the activity of a bioactive peptide, (21) cleave a neprilysinsubstrate, e.g., enkephalin; (22) modulate membrane excitability, (23)influence the resting potential of membranes; (24) modulate acetyl-CoAligase activity; (25) promote activation of acetate; (26) promoteacetate utilization; (27) enhance uptake of acetate into fatty acids andbiochemical products made from fatty acids (e.g., lipids and hormonessuch as sterol hormones); (28) crosslink an extracellular matrixcomponent; (29) regulate bone resorption and/or metabolism; (30)regulate copper metabolism; (31) it has a molecular weight, e.g., adeduced molecular weight, preferably ignoring any contribution of posttranslational modifications, amino acid composition or other physicalcharacteristic of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 polypeptide, e.g., a polypeptide of SEQ ID NO:2, 6,11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or112; (32) it has an overall sequence similarity of at least 60%,preferably at least 70%, more preferably at least 80, 90, or 95%, with apolypeptide of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55,58, 64, 67, 72, 89, 105, 108 or 112; (33) it is expressed in a multitudeof human tissues and cell lines (refer to section for each molecule ofthe invention); and (34) it has specific domains which are preferablyabout 70%, 80%, 90% or 95% identical to the identified amino acidresidues of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58,64, 67, 72, 89, 105, 108 or 112 (refer to section for each molecule ofthe invention for domain names and locations within amino acidsequence).

[0840] In a preferred embodiment the 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 protein, or fragment thereof, differsfrom the corresponding sequence in SEQ ID NO:2, 6, 11, 19, 22, 25, 32,40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112. In oneembodiment it differs by at least one but by less than 15, 10 or 5 aminoacid residues. In another it differs from the corresponding sequence inSEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72,89, 105, 108 or 112 by at least one residue but less than 20%, 15%, 10%or 5% of the residues in it differ from the corresponding sequence inSEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72,89, 105, 108 or 112. (If this comparison requires alignment thesequences should be aligned for maximum homology. “Looped” out sequencesfrom deletions or insertions, or mismatches, are considereddifferences.) The differences are, preferably, differences or changes ata nonessential residue or a conservative substitution. In a preferredembodiment the differences are not in the identified or conserveddomain(s) within SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55,58, 64, 67, 72, 89, 105, 108 or 112. In another embodiment one or moredifferences are in the cidentified or conserved domain(s) within SEQ IDNO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89,105, 108 or 112.

[0841] Other embodiments include a protein that contains one or morechanges in amino acid sequence, e.g., a change in an amino acid residuewhich is not essential for activity. Such 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 proteins differ in amino acidsequence from SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55,58, 64, 67, 72, 89, 105, 108 or 112, yet retain biological activity.

[0842] In one embodiment, the protein includes an amino acid sequence atleast about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or morehomologous to SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55,58, 64, 67, 72, 89, 105, 108 or 112.

[0843] A 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein or fragment is provided which varies from the sequence ofSEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72,89, 105, 108 or 112 in regions defined by amino acids that are notwithin identified or conserved domains or regions by at least one but byless than 15, 10 or 5 amino acid residues in the protein or fragment butwhich does not differ from SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44,47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112 in regions defined byamino acids that are within identified or conserved domains or regions.(If this comparison requires alignment the sequences should be alignedfor maximum homology. “Looped” out sequences from deletions orinsertions, or mismatches, are considered differences.) In someembodiments the difference is at a non-essential residue or is aconservative substitution, while in others the difference is at anessential residue or is a non-conservative substitution.

[0844] In one embodiment, a biologically active portion of a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 proteinincludes an identified domain (refer to section for each molecule of theinvention). Moreover, other biologically active portions, in which otherregions of the protein are deleted, can be prepared by recombinanttechniques and evaluated for one or more of the functional activities ofa native 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein.

[0845] In a preferred embodiment, the 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 protein has an amino acid sequenceshown in SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64,67, 72, 89, 105, 108 or 112. In other embodiments, the 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein issufficiently or substantially identical to SEQ ID NO:2, 6, 11, 19, 22,25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112. In yetanother embodiment, the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein is sufficiently or substantially identicalto SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67,72, 89, 105, 108 or 112 and retains the functional activity of theprotein of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58,64, 67, 72, 89, 105, 108 or 112, as described in detail in thesubsections above.

[0846] 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 Chimeric or Fusion Proteins

[0847] In another aspect, the invention provides 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 chimeric or fusionproteins. As used herein, a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 “chimeric protein” or “fusion protein”includes a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 polypeptide linked to a non-21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 polypeptide. A “non-21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptide” refers to apolypeptide having an amino acid sequence corresponding to a proteinwhich is not substantially homologous to the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein, e.g., a protein whichis different from the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein and which is derived from the same or adifferent organism. The 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 polypeptide of the fusion protein can correspond toall or a portion e.g., a fragment described herein of a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 amino acidsequence. In a preferred embodiment, a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 fusion protein includes at least one(or two) biologically active portion of a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein. The non-21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptide canbe fused to the N-terminus or C-terminus of the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptide.

[0848] The fusion protein can include a moiety which has a high affinityfor a ligand. For example, the fusion protein can be a GST-21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 fusion protein inwhich the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 sequences are fused to the C-terminus of the GST sequences. Suchfusion proteins can facilitate the purification of recombinant 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593.Alternatively, the fusion protein can be a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein containing aheterologous signal sequence at its N-terminus. In certain host cells(e.g., mammalian host cells), expression and/or secretion of 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 can beincreased through use of a heterologous signal sequence.

[0849] Fusion proteins can include all or a part of a serum protein,e.g., a portion of an immunoglobulin (e.g., IgG, IgA, or IgE), e.g., anFc region and/or the hinge C1 and C2 sequences of an immunoglobulin orhuman serum albumin.

[0850] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 fusion proteins of the invention can be incorporated intopharmaceutical compositions and administered to a subject in vivo. The21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593fusion proteins can be used to affect the bioavailability of a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 substrate.21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593fusion proteins can be useful therapeutically for the treatment ofdisorders caused by, for example, (i) aberrant modification or mutationof a gene encoding a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein; (ii) mis-regulation of the 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 gene; and (iii)aberrant post-translational modification of a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein.

[0851] Moreover, the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593-fusion proteins of the invention can be used asimmunogens to produce anti-21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 antibodies in a subject, to purify 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 ligandsand in screening assays to identify molecules which inhibit theinteraction of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 with a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 substrate.

[0852] Expression vectors are commercially available that already encodea fusion moiety (e.g., a GST polypeptide). A 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593-encoding nucleic acid can becloned into such an expression vector such that the fusion moiety islinked in-frame to the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein.

[0853] Variants of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 Proteins

[0854] In another aspect, the invention also features a variant of a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593polypeptide, e.g., which functions as an agonist (mimetics) or as anantagonist. Variants of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 proteins can be generated by mutagenesis,e.g., discrete point mutation, the insertion or deletion of sequences orthe truncation of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein. An agonist of the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 proteins can retainsubstantially the same, or a subset, of the biological activities of thenaturally occurring form of a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein. An antagonist of a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein caninhibit one or more of the activities of the naturally occurring form ofthe 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein by, for example, competitively modulating a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593-mediated activity of a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein. Thus, specific biological effects can be elicited by treatmentwith a variant of limited function. Preferably, treatment of a subjectwith a variant having a subset of the biological activities of thenaturally occurring form of the protein has fewer side effects in asubject relative to treatment with the naturally occurring form of the21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein.

[0855] Variants of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein can be identified by screening combinatoriallibraries of mutants, e.g., truncation mutants, of a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein foragonist or antagonist activity.

[0856] Libraries of fragments e.g., N terminal, C terminal, or internalfragments, of a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein coding sequence can be used to generate avariegated population of fragments for screening and subsequentselection of variants of a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein.

[0857] Variants in which a cysteine residues is added or deleted or inwhich a residue which is glycosylated is added or deleted areparticularly preferred.

[0858] Methods for screening gene products of combinatorial librariesmade by point mutations or truncation, and for screening cDNA librariesfor gene products having a selected property are known in the art.Recursive ensemble mutagenesis (REM), a new technique which enhances thefrequency of functional mutants in the libraries, can be used incombination with the screening assays to identify 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 variants (Arkin andYourvan (1992) Proc. Natl. Acad. Sci. USA 89:7811-7815; Delgrave et al.(1993) Protein Engineering 6:327-331). Cell based assays can beexploited to analyze a variegated 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 library. For example, a library ofexpression vectors can be transfected into a cell line, e.g., a cellline, which ordinarily responds to 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 in a substrate-dependent manner. Thetransfected cells are then contacted with 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 and the effect of theexpression of the mutant on signaling by the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 substrate can be detected,e.g., by measuring either guanylate kinase, phophatidylinositol4-phosphate 5-kinase, kinase, transferase, aminopeptidase, adenylatecyclase, calpain protease, oxidoreductase, neprilysin protease, AMPbinding enzyme and lysyl oxidase activity, or other activity. PlasmidDNA can then be recovered from the cells which score for inhibition, oralternatively, potentiation of signaling by the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 substrate, and theindividual clones further characterized.

[0859] In another aspect, the invention features a method of making a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593polypeptide, e.g., a peptide having a non-wild type activity, e.g., anantagonist, agonist, or super agonist of a naturally occurring 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593polypeptide, e.g., a naturally occurring 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 polypeptide. The methodincludes altering the sequence of a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 polypeptide, e.g., altering thesequence, e.g., by substitution or deletion of one or more residues of anon-conserved region, a domain or residue disclosed herein, and testingthe altered polypeptide for the desired activity.

[0860] In another aspect, the invention features a method of making afragment or analog of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 polypeptide a biological activity of a naturallyoccurring 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 polypeptide. The method includes altering the sequence, e.g., bysubstitution or deletion of one or more residues, of a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptide,e.g., altering the sequence of a non-conserved region, or a domain orresidue described herein, and testing the altered polypeptide for thedesired activity.

[0861] Anti-21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 Antibodies

[0862] In another aspect, the invention provides an anti-21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 antibody. Theterm “antibody” as used herein refers to an immunoglobulin molecule orimmunologically active portion thereof, i.e., an antigen-bindingportion. Examples of immunologically active portions of immunoglobulinmolecules include scFV and dcFV fragments, Fab and F(ab′)₂ fragmentswhich can be generated by treating the antibody with an enzyme such aspapain or pepsin, respectively.

[0863] The antibody can be a polyclonal, monoclonal, recombinant, e.g.,a chimeric or humanized, fully human, non-human, e.g., murine, or singlechain antibody. In a preferred embodiment it has effector function andcan fix complement. The antibody can be coupled to a toxin or imagingagent.

[0864] A full-length 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein or, antigenic peptide fragment of 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 can beused as an immunogen or can be used to identify anti-21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 antibodies madewith other immunogens, e.g., cells, membrane preparations, and the like.The antigenic peptide of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 should include at least 8 amino acid residues of theamino acid sequence shown in SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44,47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112 and encompasses anepitope of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593. Preferably, the antigenic peptide includes at least 10 amino acidresidues, more preferably at least 15 amino acid residues, even morepreferably at least 20 amino acid residues, and most preferably at least30 amino acid residues.

[0865] Fragments of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 which include hydrophilic regions of SEQ ID NO:2, 6,11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or112 can be used to make, e.g., used as immunogens or used tocharacterize the specificity of an antibody, antibodies againsthydrophilic regions of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein. Similarly, fragments of 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 whichinclude hydrophobic regions of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40,44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112 can be used to makean antibody against a hydrophobic region of the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein; fragments of21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593which include residues within extra cellular domain(s) of SEQ ID NO:2,6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108or 112 can be used to make an antibody against an extracellular ornon-cytoplasmic region of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein; fragments of 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 which include residueswithin intracellular regions of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40,44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112 can be used to makean antibody against an intracellular region of the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein; a fragment of21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593which include residues within identified or conserved domains of SEQ IDNO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89,105, 108 or 112 can be used to make an antibody against the identifiedor conserved domain of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein.

[0866] Antibodies reactive with, or specific or selective for, any ofthese regions, or other regions or domains described herein areprovided.

[0867] Preferred epitopes encompassed by the antigenic peptide areregions of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 located on the surface of the protein, e.g., hydrophilic regions, aswell as regions with high antigenicity. For example, an Emini surfaceprobability analysis of the human 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 protein sequence can be used toindicate the regions that have a particularly high probability of beinglocalized to the surface of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein and are thus likely to constitutesurface residues useful for targeting antibody production.

[0868] In a preferred embodiment the antibody can bind to theextracellular portion of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein, e.g., it can bind to a whole cellwhich expresses the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein. In another embodiment, the antibody bindsan intracellular portion of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein.

[0869] In a preferred embodiment the antibody binds an epitope on anydomain or region on 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 proteins described herein.

[0870] Additionally, chimeric, humanized, and completely humanantibodies are also within the scope of the invention. Chimeric,humanized, but most preferably, completely human antibodies aredesirable for applications which include repeated administration, e.g.,therapeutic treatment of human patients, and some diagnosticapplications.

[0871] Chimeric and humanized monoclonal antibodies, comprising bothhuman and non-human portions, can be made using standard recombinant DNAtechniques. Such chimeric and humanized monoclonal antibodies can beproduced by recombinant DNA techniques known in the art, for exampleusing methods described in Robinson et al. International Application No.PCT/US86/02269; Akira, et al. European Patent Application 184,187;Taniguchi, European Patent Application 171,496; Morrison et al. EuropeanPatent Application 173,494; Neuberger et al. PCT InternationalPublication No. WO 86/01533; Cabilly et al. U.S. Pat. No. 4,816,567;Cabilly et al. European Patent Application 125,023; Better et al. (1988)Science 240:1041-1043; Liu et al. (1987) Proc. Natl. Acad. Sci. USA84:3439-3443; Liu et al. (1987) J. Immunol. 139:3521-3526; Sun et al.(1987) Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura et al. (1987)Canc. Res. 47:999-1005; Wood et al. (1985) Nature 314:446-449; and Shawet al. (1988) J. Natl. Cancer Inst. 80:1553-1559).

[0872] A humanized or complementarity determining region (CDR)-graftedantibody will have at least one or two, but generally all threerecipient CDR's (of heavy and or light immuoglobulin chains) replacedwith a donor CDR. The antibody may be replaced with at least a portionof a non-human CDR or only some of the CDR's may be replaced withnon-human CDR's. It is only necessary to replace the number of CDR'srequired for binding of the humanized antibody to a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 or a fragment thereof.Preferably, the donor will be a rodent antibody, e.g., a rat or mouseantibody, and the recipient will be a human framework or a humanconsensus framework. Typically, the immunoglobulin providing the CDR'sis called the “donor” and the immunoglobulin providing the framework iscalled the “acceptor.” In one embodiment, the donor immunoglobulin is anon-human (e.g., rodent). The acceptor framework is anaturally-occurring (e.g., a human) framework or a consensus framework,or a sequence about 85% or higher, preferably 90%, 95%, 99% or higheridentical thereto.

[0873] As used herein, the term “consensus sequence” refers to thesequence formed from the most frequently occurring amino acids (ornucleotides) in a family of related sequences (See e.g., Winnaker,(1987) From Genes to Clones (Verlagsgesellschaft, Weinheim, Germany). Ina family of proteins, each position in the consensus sequence isoccupied by the amino acid occurring most frequently at that position inthe family. If two amino acids occur equally frequently, either can beincluded in the consensus sequence. A “consensus framework” refers tothe framework region in the consensus immunoglobulin sequence.

[0874] An antibody can be humanized by methods known in the art.Humanized antibodies can be generated by replacing sequences of the Fvvariable region which are not directly involved in antigen binding withequivalent sequences from human Fv variable regions. General methods forgenerating humanized antibodies are provided by Morrison (1985) Science229:1202-1207, by Oi et al. (1986) BioTechniques 4:214, and by Queen etal. U.S. Pat. Nos. 5,585,089, 5,693,761 and 5,693,762, the contents ofall of which are hereby incorporated by reference. Those methods includeisolating, manipulating, and expressing the nucleic acid sequences thatencode all or part of immunoglobulin Fv variable regions from at leastone of a heavy or light chain. Sources of such nucleic acid are wellknown to those skilled in the art and, for example, may be obtained froma hybridoma producing an antibody against a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 polypeptide or fragmentthereof. The recombinant DNA encoding the humanized antibody, orfragment thereof, can then be cloned into an appropriate expressionvector.

[0875] Humanized or CDR-grafted antibodies can be produced byCDR-grafting or CDR substitution, wherein one, two, or all CDR's of animmunoglobulin chain can be replaced. See e.g., U.S. Pat. No. 5,225,539;Jones et al. (1986) Nature 321:552-525; Verhoeyan et al. (1988) Science239:1534; Beidler et al. (1988) J. Immunol. 141:4053-4060; Winter U.S.Pat. No. 5,225,539, the contents of all of which are hereby expresslyincorporated by reference. Winter describes a CDR-grafting method whichmay be used to prepare the humanized antibodies of the present invention(UK Patent Application GB 2188638A, filed on Mar. 26, 1987; Winter U.S.Pat. No. 5,225,539), the contents of which is expressly incorporated byreference.

[0876] Also within the scope of the invention are humanized antibodiesin which specific amino acids have been substituted, deleted or added.Preferred humanized antibodies have amino acid substitutions in theframework region, such as to improve binding to the antigen. Forexample, a humanized antibody will have framework residues identical tothe donor framework residue or to another amino acid other than therecipient framework residue. To generate such antibodies, a selected,small number of acceptor framework residues of the humanizedimmunoglobulin chain can be replaced by the corresponding donor aminoacids. Preferred locations of the substitutions include amino acidresidues adjacent to the CDR, or which are capable of interacting with aCDR (see e.g., U.S. Pat. No. 5,585,089). Criteria for selecting aminoacids from the donor are described in U.S. Pat. No. 5,585,089, e.g.,columns 12-16 of U.S. Pat. No. 5,585,089, the e.g., columns 12-16 ofU.S. Pat. No. 5,585,089, the contents of which are hereby incorporatedby reference. Other techniques for humanizing antibodies are describedin Padlan et al. EP 519596 A1, published on Dec. 23, 1992.

[0877] Completely human antibodies are particularly desirable fortherapeutic treatment of human patients. Such antibodies can be producedusing transgenic mice that are incapable of expressing endogenousimmunoglobulin heavy and light chains genes, but which can express humanheavy and light chain genes. See, for example, Lonberg and Huszar (1995)Int. Rev. Immunol. 13:65-93); and U.S. Pat. Nos. 5,625,126; 5,633,425;5,569,825; 5,661,016; and 5,545,806. In addition, companies such asAbgenix, Inc. (Fremont, Calif.) and Medarex, Inc. (Princeton, N.J.), canbe engaged to provide human antibodies directed against a selectedantigen using technology similar to that described above.

[0878] Completely human antibodies that recognize a selected epitope canbe generated using a technique referred to as “guided selection.” Inthis approach a selected non-human monoclonal antibody, e.g., a murineantibody, is used to guide the selection of a completely human antibodyrecognizing the same epitope. This technology is described by Jespers etal. (1994) Bio/Technology 12:899-903).

[0879] The anti-21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 antibody can be a single chain antibody. A single-chainantibody (scFV) can be engineered as described in, for example, Colcheret al. (1999) Ann. NY Acad. Sci. 880:263-80; and Reiter (1996) Clin.Cancer Res. 2:245-52. The single chain antibody can be dimerized ormultimerized to generate multivalent antibodies having specificities fordifferent epitopes of the same target 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 protein.

[0880] In a preferred embodiment, the antibody has reduced or no abilityto bind an Fc receptor. For example, it is an isotype or subtype,fragment or other mutant, which does not support binding to an Fcreceptor, e.g., it has a mutagenized or deleted Fc receptor bindingregion.

[0881] An antibody (or fragment thereof) may be conjugated to atherapeutic moiety such as a cytotoxin, a therapeutic agent or aradioactive ion. A cytotoxin or cytotoxic agent includes any agent thatis detrimental to cells. Examples include taxol, cytochalasin B,gramicidin D, ethidium bromide, emetine, mitomycin, etoposide,tenoposide, vincristine, vinblastine, colchicin, doxorubicin,daunorubicin, dihydroxy anthracin dione, mitoxantrone, mithramycin,actinomycin D, 1-dehydrotestosterone, glucocorticoids, procaine,tetracaine, lidocaine, propranolol, puromycin, maytansinoids, e.g.,maytansinol (see U.S. Pat. No. 5,208,020), CC-1065 (see U.S. Pat. Nos.5,475,092, 5,585,499, 5,846,545) and analogs or homologs thereof.Therapeutic agents include, but are not limited to, antimetabolites(e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine,5-fluorouracil decarbazine), alkylating agents (e.g., mechlorethamine,thioepa chlorambucil, CC-1065, melphalan, carmustine (BSNU) andlomustine (CCNU), cyclothosphamide, busulfan, dibromomannitol,streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) (DDP)cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) anddoxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin),bleomycin, mithramycin, and anthramycin (AMC)), and anti-mitotic agents(e.g., vincristine, vinblastine, taxol and maytansinoids).

[0882] Radioactive ions include, but are not limited to iodine, yttrium,lutecium and praseodymium.

[0883] The conjugates of the invention can be used for modifying a givenbiological response, the therapeutic moiety is not to be construed aslimited to classical chemical therapeutic agents. For example, thetherapeutic moiety may be a protein or polypeptide possessing a desiredbiological activity. Such proteins may include, for example, a toxinsuch as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; aprotein such as tumor necrosis factor, α-interferon, β-interferon, nervegrowth factor, platelet derived growth factor, tissue plasminogenactivator; or, biological response modifiers such as, for example,lymphokines, interleukin-1 (“IL-1”), interleukin-2 (“IL-2”),interleukin-6 (“IL-6”), granulocyte macrophase colony stimulating factor(“GM-CSF”), granulocyte colony stimulating factor (“G-CSF”), or othergrowth factors.

[0884] Alternatively, an antibody can be conjugated to a second antibodyto form an antibody heteroconjugate as described by Segal in U.S. Pat.No. 4,676,980.

[0885] An anti-21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 antibody (e.g., monoclonal antibody) can be used to isolate21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 bystandard techniques, such as affinity chromatography orimmunoprecipitation. Moreover, an anti-21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 antibody can be used to detect 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein(e.g., in a cellular lysate or cell supernatant) in order to evaluatethe abundance and pattern of expression of the protein. Anti-21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 antibodiescan be used diagnostically to monitor protein levels in tissue as partof a clinical testing procedure, e.g., to determine the efficacy of agiven treatment regimen. Detection can be facilitated by coupling (i.e.,physically linking) the antibody to a detectable substance (i.e.,antibody labelling). Examples of detectable substances include variousenzymes, prosthetic groups, fluorescent materials, luminescentmaterials, bioluminescent materials, and radioactive materials. Examplesof suitable enzymes include horseradish peroxidase, alkalinephosphatase, β-galactosidase, or acetylcholinesterase; examples ofsuitable prosthetic group complexes include streptavidin/biotin andavidin/biotin; examples of suitable fluorescent materials includeumbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine,dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; anexample of a luminescent material includes luminol; examples ofbioluminescent materials include luciferase, luciferin, and aequorin,and examples of suitable radioactive material include ¹²⁵I, ¹³¹I, ³⁵S or³H.

[0886] In preferred embodiments, an antibody can be made by immunizingwith a purified 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 antigen, or a fragment thereof, e.g., a fragment describedherein, a membrane associated antigen, tissues, e.g., crude tissuepreparations, whole cells, preferably living cells, lysed cells, or cellfractions, e.g., membrane fractions.

[0887] Antibodies which bind only a native 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein, only denatured orotherwise non-native 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein, or which bind both, are within theinvention. Antibodies with linear or conformational epitopes are withinthe invention. Conformational epitopes sometimes can be identified byidentifying antibodies which bind to native but not denatured 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein.

[0888] Recombinant Expression Vectors, Host Cells and GeneticallyEngineered Cells

[0889] In another aspect, the invention includes, vectors, preferablyexpression vectors, containing a nucleic acid encoding a polypeptidedescribed herein. As used herein, the term “vector” refers to a nucleicacid molecule capable of transporting another nucleic acid to which ithas been linked and can include a plasmid, cosmid or viral vector. Thevector can be capable of autonomous replication or it can integrate intoa host DNA. Viral vectors include, e.g., replication defectiveretroviruses, adenoviruses and adeno-associated viruses.

[0890] A vector can include a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 nucleic acid in a form suitable forexpression of the nucleic acid in a host cell.

[0891] Preferably the recombinant expression vector includes one or moreregulatory sequences operatively linked to the nucleic acid sequence tobe expressed. The term “regulatory sequence” includes promoters,enhancers and other expression control elements (e.g., polyadenylationsignals). Regulatory sequences include those which direct constitutiveexpression of a nucleotide sequence, as well as tissue-specificregulatory and/or inducible sequences. The design of the expressionvector can depend on such factors as the choice of the host cell to betransformed, the level of expression of protein desired, and the like.The expression vectors of the invention can be introduced into hostcells to thereby produce proteins or polypeptides, including fusionproteins or polypeptides, encoded by nucleic acids as described herein(e.g., 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 proteins, mutant forms of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 proteins, fusion proteins, and the like).

[0892] The recombinant expression vectors of the invention can bedesigned for expression of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 proteins in prokaryotic or eukaryotic cells.For example, polypeptides of the invention can be expressed in E. coli,insect cells (e.g., using baculovirus expression vectors), yeast cellsor mammalian cells. Suitable host cells are discussed further inGoeddel, (1990) Gene Expression Technology: Methods in Enzymology 185,Academic Press, San Diego, Calif. Alternatively, the recombinantexpression vector can be transcribed and translated in vitro, forexample using T7 promoter regulatory sequences and T7 polymerase.

[0893] Expression of proteins in prokaryotes is most often carried outin E. coli with vectors containing constitutive or inducible promotersdirecting the expression of either fusion or non-fusion proteins. Fusionvectors add a number of amino acids to a protein encoded therein,usually to the amino terminus of the recombinant protein. Such fusionvectors typically serve three purposes: 1) to increase expression ofrecombinant protein; 2) to increase the solubility of the recombinantprotein; and 3) to aid in the purification of the recombinant protein byacting as a ligand in affinity purification. Often, a proteolyticcleavage site is introduced at the junction of the fusion moiety and therecombinant protein to enable separation of the recombinant protein fromthe fusion moiety subsequent to purification of the fusion protein. Suchenzymes, and their cognate recognition sequences, include Factor Xa,thrombin and enterokinase. Typical fusion expression vectors includepGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 67:31-40),pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia,Piscataway, N.J.) which fuse glutathione S-transferase (GST), maltose Ebinding protein, or protein A, respectively, to the target recombinantprotein.

[0894] Purified fusion proteins can be used in 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 activity assays, (e.g.,direct assays or competitive assays described in detail below), or togenerate antibodies specific or selective for 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 proteins. In a preferredembodiment, a fusion protein-expressed in a retroviral expression vectorof the present invention can be used to infect bone marrow cells whichare subsequently transplanted into irradiated recipients. The pathologyof the subject recipient is then examined after sufficient time haspassed (e.g., six weeks).

[0895] To maximize recombinant protein expression in E. coli is toexpress the protein in a host bacteria with an impaired capacity toproteolytically cleave the recombinant protein (Gottesman (1990) GeneExpression Technology: Methods in Enzymology 185, Academic Press, SanDiego, Calif. 119-128). Another strategy is to alter the nucleic acidsequence of the nucleic acid to be inserted into an expression vector sothat the individual codons for each amino acid are those preferentiallyutilized in E. coli (Wada et al., (1992) Nucleic Acids Res.20:2111-2118). Such alteration of nucleic acid sequences of theinvention can be carried out by standard DNA synthesis techniques.

[0896] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 expression vector can be a yeast expression vector, a vector forexpression in insect cells, e.g., a baculovirus expression vector or avector suitable for expression in mammalian cells.

[0897] When used in mammalian cells, the expression vector's controlfunctions are often provided by viral regulatory elements. For example,commonly used promoters are derived from polyoma, Adenovirus 2,cytomegalovirus and Simian Virus 40.

[0898] In another embodiment, the recombinant mammalian expressionvector is capable of directing expression of the nucleic acidpreferentially in a particular cell type (e.g., tissue-specificregulatory elements are used to express the nucleic acid). Non-limitingexamples of suitable tissue-specific promoters include the albuminpromoter (liver-specific; Pinkert et al. (1987) Genes Dev. 1:268-277),lymphoid-specific promoters (Calame and Eaton (1988) Adv. Immunol.43:235-275), in particular promoters of T cell receptors (Winoto andBaltimore (1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et al.(1983) Cell 33:729-740; Queen and Baltimore (1983) Cell 33:741-748),neuron-specific promoters (e.g., the neurofilament promoter; Byrne andRuddle (1989) Proc. Natl. Acad. Sci. USA 86:5473-5477),pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916),and mammary gland-specific promoters (e.g., milk whey promoter; U.S.Pat. No. 4,873,316 and European Application Publication No. 264,166).Developmentally-regulated promoters are also encompassed, for example,the murine hox promoters (Kessel and Gruss (1990) Science 249:374-379)and the α-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev.3:537-546).

[0899] The invention further provides a recombinant expression vectorcomprising a DNA molecule of the invention cloned into the expressionvector in an antisense orientation. Regulatory sequences (e.g., viralpromoters and/or enhancers) operatively linked to a nucleic acid clonedin the antisense orientation can be chosen which direct theconstitutive, tissue specific or cell type specific expression ofantisense RNA in a variety of cell types. The antisense expressionvector can be in the form of a recombinant plasmid, phagemid orattenuated virus. For a discussion of the regulation of gene expressionusing antisense genes see Weintraub et al., (1986) Reviews—Trends inGenetics 1:1.

[0900] Another aspect the invention provides a host cell which includesa nucleic acid molecule described herein, e.g., a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleic acid moleculewithin a recombinant expression vector or a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 nucleic acid moleculecontaining sequences which allow it to homologously recombine into aspecific site of the host cell's genome. The terms “host cell” and“recombinant host cell” are used interchangeably herein. Such termsrefer not only to the particular subject cell but to the progeny orpotential progeny of such a cell. Because certain modifications canoccur in succeeding generations due to either mutation or environmentalinfluences, such progeny may not, in fact, be identical to the parentcell, but are still included within the scope of the term as usedherein.

[0901] A host cell can be any prokaryotic or eukaryotic cell. Forexample, a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein can be expressed in bacterial cells such as E. coli, insectcells, yeast or mammalian cells (such as Chinese hamster ovary (CHO)cells or CV-1 origin, SV-40 (COS) cells). Other suitable host cells areknown to those skilled in the art.

[0902] Vector DNA can be introduced into host cells via conventionaltransformation or transfection techniques. As used herein, the terms“transformation” and “transfection” are intended to refer to a varietyof art-recognized techniques for introducing foreign nucleic acid (e.g.,DNA) into a host cell, including calcium phosphate or calcium chlorideco-precipitation, DEAE-dextran-mediated transfection, lipofection, orelectroporation.

[0903] A host cell of the invention can be used to produce (i.e.,express) a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein. Accordingly, the invention further provides methods forproducing a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein using the host cells of the invention. In one embodiment,the method includes culturing the host cell of the invention (into whicha recombinant expression vector encoding a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein has been introduced) ina suitable medium such that a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein is produced. In another embodiment,the method further includes isolating a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein from the medium or thehost cell.

[0904] In another aspect, the invention features, a cell or purifiedpreparation of cells which include a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 transgene, or which otherwisemisexpress 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593. The cell preparation can consist of human or non-human cells, e.g.,rodent cells, e.g., mouse or rat cells, rabbit cells, or pig cells. Inpreferred embodiments, the cell or cells include a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 transgene, e.g., aheterologous form of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593, e.g., a gene derived from humans (in the case of anon-human cell). The 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 transgene can be misexpressed, e.g., overexpressedor underexpressed. In other preferred embodiments, the cell or cellsinclude a gene which misexpresses an endogenous 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593, e.g., a gene theexpression of which is disrupted, e.g., a knockout. Such cells can serveas a model for studying disorders which are related to mutated ormisexpressed 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 alleles or for use in drug screening.

[0905] In another aspect, the invention features, a human cell, e.g., ahematopoietic stem cell, transformed with nucleic acid which encodes asubject 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 polypeptide.

[0906] Also provided are cells, preferably human cells, e.g., humanhematopoietic or fibroblast cells, in which an endogenous 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 is under thecontrol of a regulatory sequence that does not normally control theexpression of the endogenous 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 gene. The expression characteristics of anendogenous gene within a cell, e.g., a cell line or microorganism, canbe modified by inserting a heterologous DNA regulatory element into thegenome of the cell such that the inserted regulatory element is operablylinked to the endogenous 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 gene. For example, an endogenous 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 gene which is“transcriptionally silent,” e.g., not normally expressed, or expressedonly at very low levels, can be activated by inserting a regulatoryelement which is capable of promoting the expression of a normallyexpressed gene product in that cell. Techniques such as targetedhomologous recombinations, can be used to insert the heterologous DNA asdescribed in, e.g., Chappel, U.S. Pat. No. 5,272,071; WO 91/06667,published in May 16, 1991.

[0907] Transgenic Animals

[0908] The invention provides non-human transgenic animals. Such animalsare useful for studying the function and/or activity of a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, ml 983, 38555 or 593 protein and foridentifying and/or evaluating modulators of 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 activity. As used herein, a“transgenic animal” is a non-human animal, preferably a mammal, morepreferably a rodent such as a rat or mouse, in which one or more of thecells of the animal includes a transgene. Other examples of transgenicanimals include non-human primates, sheep, dogs, cows, goats, chickens,amphibians, and the like. A transgene is exogenous DNA or arearrangement, e.g., a deletion of endogenous chromosomal DNA, whichpreferably is integrated into or occurs in the genome of the cells of atransgenic animal. A transgene can direct the expression of an encodedgene product in one or more cell types or tissues of the transgenicanimal, other transgenes, e.g., a knockout, reduce expression. Thus, atransgenic animal can be one in which an endogenous 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 gene has been alteredby, e.g., by homologous recombination between the endogenous gene and anexogenous DNA molecule introduced into a cell of the animal, e.g., anembryonic cell of the animal, prior to development of the animal.

[0909] Intronic sequences and polyadenylation signals can also beincluded in the transgene to increase the efficiency of expression ofthe transgene. A tissue-specific regulatory sequence(s) can be operablylinked to a transgene of the invention to direct expression of a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein toparticular cells. A transgenic founder animal can be identified basedupon the presence of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 transgene in its genome and/or expression of 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h 1983, m1983, 38555 or 593 mRNA intissues or cells of the animals. A transgenic founder animal can then beused to breed additional animals carrying the transgene. Moreover,transgenic animals carrying a transgene encoding a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h 1983, m1983, 38555 or 593 protein can further bebred to other transgenic animals carrying other transgenes.

[0910] 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 proteins or polypeptides can be expressed in transgenic animals orplants, e.g., a nucleic acid encoding the protein or polypeptide can beintroduced into the genome of an animal. In preferred embodiments thenucleic acid is placed under the control of a tissue specific promoter,e.g., a milk or egg specific promoter, and recovered from the milk oreggs produced by the animal. Suitable animals are mice, pigs, cows,goats, and sheep.

[0911] The invention also includes a population of cells from atransgenic animal, as discussed, e.g., below.

[0912] Uses

[0913] The nucleic acid molecules, proteins, protein homologs, andantibodies described herein can be used in one or more of the followingmethods: a) screening assays; b) predictive medicine (e.g., diagnosticassays, prognostic assays, monitoring clinical trials, andpharmacogenetics); and c) methods of treatment (e.g., therapeutic andprophylactic). The isolated nucleic acid molecules of the invention canbe used, for example, to express a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 protein (e.g., via a recombinantexpression vector in a host cell in gene therapy applications), todetect a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 mRNA (e.g., in a biological sample) or a genetic alteration in a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593gene, and to modulate 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 activity, as described further below. The 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 proteinscan be used to treat disorders characterized by insufficient orexcessive production of a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 substrate or production of 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 inhibitors. Inaddition, the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 proteins can be used to screen for naturally occurring21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593substrates, to screen for drugs or compounds which modulate 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 activity,as well as to treat disorders characterized by insufficient or excessiveproduction of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein or production of 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 protein forms which have decreased,aberrant or unwanted activity compared to 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 wild type protein (e.g.,aberrant or deficient guanylate kinase activity, phophatidylinositol4-phosphate 5-kinase activity, kinase activity, transferase activity,aminopeptidase activity, adenylate cyclase activity, calpain proteaseactivity, oxidoreductase activity, neprilysin protease activity, AMPbinding enzyme activity and lysyl oxidase activity, or other activity).Moreover, the anti-21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 antibodies of the invention can be used to detectand isolate 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 proteins, regulate the bioavailability of 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 proteins, and modulate 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 activity.

[0914] A method of evaluating a compound for the ability to interactwith, e.g., bind, a subject 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 polypeptide is provided. The method includes:contacting the compound with the subject 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 polypeptide; and evaluatingability of the compound to interact with, e.g., to bind or form acomplex with the subject 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 polypeptide. This method can be performed in vitro,e.g., in a cell free system, or in vivo, e.g., in a two-hybridinteraction trap assay. This method can be used to identify naturallyoccurring molecules which interact with subject 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptide. It can alsobe used to find natural or synthetic inhibitors of subject 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptide.Screening methods are discussed in more detail below.

[0915] Screening Assays

[0916] The invention provides methods (also referred to herein as“screening assays”) for identifying modulators, i.e., candidate or testcompounds or agents (e.g., proteins, peptides, peptidomimetics,peptoids, small molecules or other drugs) which bind to 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 proteins, have astimulatory or inhibitory effect on, for example, 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 expression or 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 activity,or have a stimulatory or inhibitory effect on, for example, theexpression or activity of a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 substrate. Compounds thus identified can beused to modulate the activity of target gene products (e.g., 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 genes) ina therapeutic protocol, to elaborate the biological function of thetarget gene product, or to identify compounds that disrupt normal targetgene interactions.

[0917] In one embodiment, the invention provides assays for screeningcandidate or test compounds which are substrates of a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein orpolypeptide or a biologically active portion thereof. In anotherembodiment, the invention provides assays for screening candidate ortest compounds which bind to or modulate the activity of a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein orpolypeptide or a biologically active portion thereof.

[0918] The test compounds of the present invention can be obtained usingany of the numerous approaches in combinatorial library methods known inthe art, including: biological libraries; peptoid libraries (librariesof molecules having the functionalities of peptides, but with a novel,non-peptide backbone which are resistant to enzymatic degradation butwhich nevertheless remain bioactive; see, e.g., Zuckermann et al. (1994)J. Med. Chem. 37:2678-85); spatially addressable parallel solid phase orsolution phase libraries; synthetic library methods requiringdeconvolution; the ‘one-bead one-compound’ library method; and syntheticlibrary methods using affinity chromatography selection. The biologicallibrary and peptoid library approaches are limited to peptide libraries,while the other four approaches are applicable to peptide, non-peptideoligomer or small molecule libraries of compounds (Lam (1997) AnticancerDrug Des. 12:145).

[0919] Examples of methods for the synthesis of molecular libraries canbe found in the art, for example in: DeWitt et al. (1993) Proc. Natl.Acad. Sci. U.S.A. 90:6909-13; Erb et al. (1994) Proc. Natl. Acad. Sci.USA 91:11422-426; Zuckermann et al. (1994). J. Med. Chem. 37:2678-85;Cho et al. (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem.Int. Ed. Engl. 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl.33:2061; and in Gallop et al. (1994) J. Med. Chem. 37:1233-51.

[0920] Libraries of compounds can be presented in solution (e.g.,Houghten (1992) Biotechniques 13:412-421), or on beads (Lam (1991)Nature 354:82-84), chips (Fodor (1993) Nature 364:555-556), bacteria(Ladner, U.S. Pat. No. 5,223,409), spores (Ladner USP '409), plasmids(Cull et al. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage(Scott and Smith (1990) Science 249:386-390; Devlin (1990) Science249:404-406; Cwirla et al. (1990) Proc. Natl. Acad. Sci. 87:6378-6382;Felici (1991) J. Mol. Biol. 222:301-310; Ladner supra.).

[0921] In one embodiment, an assay is a cell-based assay in which a cellwhich expresses a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein or biologically active portion thereof is contactedwith a test compound, and the ability of the test compound to modulate21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593activity is determined. Determining the ability of the test compound tomodulate 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 activity can be accomplished by monitoring, for example, guanylatekinase activity, phophatidylinositol 4-phosphate 5-kinase activity,kinase activity, transferase activity, aminopeptidase activity,adenylate cyclase activity, calpain protease activity, oxidoreductaseactivity, neprilysin protease activity, AMP binding enzyme activity andlysyl oxidase activity, or other activity. The cell, for example, can beof mammalian origin, e.g., human.

[0922] The ability of the test compound to modulate 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 binding to a compound,e.g., a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 substrate, or to bind to 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 can also be evaluated. This can beaccomplished, for example, by coupling the compound, e.g., thesubstrate, with a radioisotope or enzymatic label such that binding ofthe compound, e.g., the substrate, to 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 can be determined by detecting thelabeled compound, e.g., substrate, in a complex. Alternatively, 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 566381, 18610, 33217, 21967, h1983, m1983, 38555 or 593 could becoupled with a radioisotope or enzymatic label to monitor the ability ofa test compound to modulate 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 binding to a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 substrate in a complex. Forexample, compounds (e.g., 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 substrates) can be labeled with ¹²⁵I, ¹⁴C,³⁵S or ³H, either directly or indirectly, and the radioisotope detectedby direct counting of radioemmission or by scintillation counting.Alternatively, compounds can be enzymatically labeled with, for example,horseradish peroxidase, alkaline phosphatase, or luciferase, and theenzymatic label detected by determination of conversion of anappropriate substrate to product.

[0923] The ability of a compound (e.g., a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 substrate) to interact with21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593with or without the labeling of any of the interactants can beevaluated. For example, a microphysiometer can be used to detect theinteraction of a compound with 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 without the labeling of either the compoundor the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593. McConnell et al. (1992) Science 257:1906-1912. As used herein, a“microphysiometer” (e.g., Cytosensor) is an analytical instrument thatmeasures the rate at which a cell acidifies its environment using alight-addressable potentiometric sensor (LAPS). Changes in thisacidification rate can be used as an indicator of the interactionbetween a compound and 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593.

[0924] In yet another embodiment, a cell-free assay is provided in whicha 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein or biologically active portion thereof is contacted with a testcompound and the ability of the test compound to bind to the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein orbiologically active portion thereof is evaluated. Preferred biologicallyactive portions of the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 proteins to be used in assays of the presentinvention include fragments which participate in interactions withnon-21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593molecules, e.g., fragments with high surface probability scores.

[0925] Soluble and/or membrane-bound forms of isolated proteins (e.g.,21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593proteins or biologically active portions thereof) can be used in thecell-free assays of the invention. When membrane-bound forms of theprotein are used, it may be desirable to utilize a solubilizing agent.Examples of such solubilizing agents include non-ionic detergents suchas n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside,octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton® X-100,Triton® X-114, Thesit®, Isotridecypoly(ethylene glycol ether)_(n),3-[(3-cholamidopropyl)dimethylamminio]-1-propane sulfonate (CHAPS),3-[(3-cholamidopropyl)dimethylamminio]-2-hydroxy-1-propane sulfonate(CHAPSO), or N-dodecyl=N,N-dimethyl-3-ammonio-1-propane sulfonate.

[0926] Cell-free assays involve preparing a reaction mixture of thetarget gene protein and the test compound under conditions and for atime sufficient to allow the two components to interact and bind, thusforming a complex that can be removed and/or detected.

[0927] The interaction between two molecules can also be detected, e.g.,using fluorescence energy transfer (FET) (see, for example, Lakowicz etal., U.S. Pat. No. 5,631,169; Stavrianopoulos, et al., U.S. Pat. No.4,868,103). A fluorophore label on the first, ‘donor’ molecule isselected such that its emitted fluorescent energy will be absorbed by afluorescent label on a second, ‘acceptor’ molecule, which in turn isable to fluoresce due to the absorbed energy. Alternately, the ‘donor’protein molecule can simply utilize the natural fluorescent energy oftryptophan residues. Labels are chosen that emit different wavelengthsof light, such that the ‘acceptor’ molecule label can be differentiatedfrom that of the ‘donor’. Since the efficiency of energy transferbetween the labels is related to the distance separating the molecules,the spatial relationship between the molecules can be assessed. In asituation in which binding occurs between the molecules, the fluorescentemission of the ‘acceptor’ molecule label in the assay should bemaximal. An FET binding event can be conveniently measured throughstandard fluorometric detection means well known in the art (e.g., usinga fluorimeter).

[0928] In another embodiment, determining the ability of the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein tobind to a target molecule can be accomplished using real-timeBiomolecular Interaction Analysis (BIA) (see, e.g., Sjolander andUrbaniczky (1991) Anal. Chem. 63:2338-2345 and Szabo et al. (1995) Curr.Opin. Struct. Biol. 5:699-705). “Surface plasmon resonance” or “BIA”detects biospecific interactions in real time, without labeling any ofthe interactants (e.g., BIAcore). Changes in the mass at the bindingsurface (indicative of a binding event) result in alterations of therefractive index of light near the surface (the optical phenomenon ofsurface plasmon resonance (SPR8)), resulting in a detectable signalwhich can be used as an indication of real-time reactions betweenbiological molecules.

[0929] In one embodiment, the target gene product or the test substanceis anchored onto a solid phase. The target gene product/test compoundcomplexes anchored on the solid phase can be detected at the end of thereaction. Preferably, the target gene product can be anchored onto asolid surface, and the test compound, (which is not anchored), can belabeled, either directly or indirectly, with detectable labels discussedherein.

[0930] It may be desirable to immobilize either 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593, an anti-21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 antibody or itstarget molecule to facilitate separation of complexed from uncomplexedforms of one or both of the proteins, as well as to accommodateautomation of the assay. Binding of a test compound to a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein, orinteraction of a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein with a target molecule in the presence and absenceof a candidate compound, can be accomplished in any vessel suitable forcontaining the reactants. Examples of such vessels include microtiterplates, test tubes, and micro-centrifuge tubes. In one embodiment, afusion protein can be provided which adds a domain that allows one orboth of the proteins to be bound to a matrix. For example,glutathione-S-transferase/21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 fusion proteins orglutathione-S-transferase/target fusion proteins can be adsorbed ontoglutathione sepharose beads (Sigma Chemical, St. Louis, Mo.) orglutathione derivatized microtiter plates, which are then combined withthe test compound or the test compound and either the non-adsorbedtarget protein or 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein, and the mixture incubated under conditionsconducive to complex formation (e.g., at physiological conditions forsalt and pH).

[0931] Following incubation, the beads or microtiter plate wells arewashed to remove any unbound components, the matrix immobilized in thecase of beads, complex determined either directly or indirectly, forexample, as described above. Alternatively, the complexes can bedissociated from the matrix, and the level of 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 binding or activity determinedusing standard techniques.

[0932] Other techniques for immobilizing either a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein or a targetmolecule on matrices include using conjugation of biotin andstreptavidin. Biotinylated 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein or target molecules can be preparedfrom biotin-NHS (N-hydroxy-succinimide) using techniques known in theart (e.g., biotinylation kit, Pierce Chemicals, Rockford, Ill.), andimmobilized in the wells of streptavidin-coated 96 well plates (PierceChemical).

[0933] In order to conduct the assay, the non-immobilized component isadded to the coated surface containing the anchored component. After thereaction is complete, unreacted components are removed (e.g., bywashing) under conditions such that any complexes formed will remainimmobilized on the solid surface. The detection of complexes anchored onthe solid surface can be accomplished in a number of ways. Where thepreviously non-immobilized component is pre-labeled, the detection oflabel immobilized on the surface indicates that complexes were formed.Where the previously non-immobilized component is not pre-labeled, anindirect label can be used to detect complexes anchored on the surface;e.g., using a labeled antibody specific or selective for the immobilizedcomponent (the antibody, in turn, can be directly labeled or indirectlylabeled with, e.g., a labeled anti-Ig antibody).

[0934] In one embodiment, this assay is performed utilizing antibodiesreactive with 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein or target molecules but which do not interfere withbinding of the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein to its target molecule. Such antibodies can bederivatized to the wells of the plate, and unbound target or 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 proteintrapped in the wells by antibody conjugation. Methods for detecting suchcomplexes, in addition to those described above for the GST-immobilizedcomplexes, include immunodetection of complexes using antibodiesreactive with the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein or target molecule, as well as enzyme-linked assayswhich rely on detecting an enzymatic activity associated with the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein ortarget molecule.

[0935] Alternatively, cell free assays can be conducted in a liquidphase. In such an assay, the reaction products are separated fromunreacted components, by any of a number of standard techniques,including but not limited to: differential centrifugation (see, forexample, Rivas and Minton (1993) Trends Biochem Sci 18:284-7);chromatography (gel filtration chromatography, ion-exchangechromatography); electrophoresis (see, e.g., Ausubel et al., eds. (1999)Current Protocols in Molecular Biology, J. Wiley, New York.); andimmunoprecipitation (see, for example, Ausubel et al., eds. (1999)Current Protocols in Molecular Biology, J. Wiley, New York). Such resinsand chromatographic techniques are known to one skilled in the art (see,e.g., Heegaard (1998) J Mol Recognit 11: 141-8; Hage and Tweed (1997) JChromatogr B Biomed Sci Appl. 699:499-525). Further, fluorescence energytransfer can also be conveniently utilized, as described herein, todetect binding without further purification of the complex fromsolution.

[0936] In a preferred embodiment, the assay includes contacting the21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein or biologically active portion thereof with a known compoundwhich binds 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 to form an assay mixture, contacting the assay mixture with a testcompound, and determining the ability of the test compound to interactwith a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein, wherein determining the ability of the test compound tointeract with a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein includes determining the ability of the testcompound to preferentially bind to 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 or biologically active portionthereof, or to modulate the activity of a target molecule, as comparedto the known compound.

[0937] The target gene products of the invention can, in vivo, interactwith one or more cellular or extracellular macromolecules, such asproteins. For the purposes of this discussion, such cellular andextracellular macromolecules are referred to herein as “bindingpartners.” Compounds that disrupt such interactions can be useful inregulating the activity of the target gene product. Such compounds caninclude, but are not limited to molecules such as antibodies, peptides,and small molecules. The preferred target genes/products for use in thisembodiment are the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 genes herein identified. In an alternativeembodiment, the invention provides methods for determining the abilityof the test compound to modulate the activity of a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein throughmodulation of the activity of a downstream effector of a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 target molecule.For example, the activity of the effector molecule on an appropriatetarget can be determined, or the binding of the effector to anappropriate target can be determined, as previously described.

[0938] To identify compounds that interfere with the interaction betweenthe target gene product and its cellular or extracellular bindingpartner(s), a reaction mixture containing the target gene product andthe binding partner is prepared, under conditions and for a timesufficient, to allow the two products to form complex. In order to testan inhibitory agent, the reaction mixture is provided in the presenceand absence of the test compound. The test compound can be initiallyincluded in the reaction mixture, or can be added at a time subsequentto the addition of the target gene and its cellular or extracellularbinding partner. Control reaction mixtures are incubated without thetest compound or with a placebo. The formation of any complexes betweenthe target gene product and the cellular or extracellular bindingpartner is then detected. The formation of a complex in the controlreaction, but not in the reaction mixture containing the test compound,indicates that the compound interferes with the interaction of thetarget gene product and the interactive binding partner.

[0939] Additionally, complex formation within reaction mixturescontaining the test compound and normal target gene product can also becompared to complex formation within reaction mixtures containing thetest compound and mutant target gene product. This comparison can beimportant in those cases wherein it is desirable to identify compoundsthat disrupt interactions of mutant but not normal target gene products.

[0940] These assays can be conducted in a heterogeneous or homogeneousformat. Heterogeneous assays involve anchoring either the target geneproduct or the binding partner onto a solid phase, and detectingcomplexes anchored on the solid phase at the end of the reaction. Inhomogeneous assays, the entire reaction is carried out in a liquidphase. In either approach, the order of addition of reactants can bevaried to obtain different information about the compounds being tested.For example, test compounds that interfere with the interaction betweenthe target gene products and the binding partners, e.g., by competition,can be identified by conducting the reaction in the presence of the testsubstance. Alternatively, test compounds that disrupt preformedcomplexes, e.g., compounds with higher binding constants that displaceone of the components from the complex, can be tested by adding the testcompound to the reaction mixture after complexes have been formed. Thevarious formats are briefly described below.

[0941] In a heterogeneous assay system, either the target gene productor the interactive cellular or extracellular binding partner, isanchored onto a solid surface (e.g., a microtiter plate), while thenon-anchored species is labeled, either directly or indirectly. Theanchored species can be immobilized by non-covalent or covalentattachments. Alternatively, an immobilized antibody specific orselective for the species to be anchored can be used to anchor thespecies to the solid surface.

[0942] In order to conduct the assay, the partner of the immobilizedspecies is exposed to the coated surface with or without the testcompound. After the reaction is complete, unreacted components areremoved (e.g., by washing) and any complexes formed will remainimmobilized on the solid surface. Where the non-immobilized species ispre-labeled, the detection of label immobilized on the surface indicatesthat complexes were formed. Where the non-immobilized species is notpre-labeled, an indirect label can be used to detect complexes anchoredon the surface; e.g., using a labeled antibody specific or selective forthe initially non-immobilized species (the antibody, in turn, can bedirectly labeled or indirectly labeled with, e.g., a labeled anti-Igantibody). Depending upon the order of addition of reaction components,test compounds that inhibit complex formation or that disrupt preformedcomplexes can be detected.

[0943] Alternatively, the reaction can be conducted in a liquid phase inthe presence or absence of the test compound, the reaction productsseparated from unreacted components, and complexes detected; e.g., usingan immobilized antibody specific or selective for one of the bindingcomponents to anchor any complexes formed in solution, and a labeledantibody specific or selective for the other partner to detect anchoredcomplexes. Again, depending upon the order of addition of reactants tothe liquid phase, test compounds that inhibit complex or that disruptpreformed complexes can be identified.

[0944] In an alternate embodiment of the invention, a homogeneous assaycan be used. For example, a preformed complex of the target gene productand the interactive cellular or extracellular binding partner product isprepared in that either the target gene products or their bindingpartners are labeled, but the signal generated by the label is quencheddue to complex formation (see, e.g., U.S. Pat. No. 4,109,496 thatutilizes this approach for immunoassays). The addition of a testsubstance that competes with and displaces one of the species from thepreformed complex will result in the generation of a signal abovebackground. In this way, test substances that disrupt target geneproduct-binding partner interaction can be identified.

[0945] In yet another aspect, the 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 proteins can be used as “baitproteins” in a two-hybrid assay or three-hybrid assay (see, e.g., U.S.Pat. No. 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al.(1993) J. Biol. Chem. 268:12046-12054; Bartel et al. (1993)Biotechniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693-1696;and Brent WO94/10300), to identify other proteins, which bind to orinteract with 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 (“21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593-binding proteins” or “21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593-bp”) and are involved in 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 activity. Such21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593-bpscan be activators or inhibitors of signals by the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 proteins or 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 targetsas, for example, downstream elements of a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593-mediated signaling pathway.

[0946] The two-hybrid system is based on the modular nature of mosttranscription factors, which consist of separable DNA-binding andactivation domains. Briefly, the assay utilizes two different DNAconstructs. In one construct, the gene that codes for a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein is fusedto a gene encoding the DNA binding domain of a known transcriptionfactor (e.g., GAL-4). In the other construct, a DNA sequence, from alibrary of DNA sequences, that encodes an unidentified protein (“prey”or “sample”) is fused to a gene that codes for the activation domain ofthe known transcription factor. (Alternatively the: 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein can be the fusedto the activator domain.) If the “bait” and the “prey” proteins are ableto interact, in vivo, forming a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593-dependent complex, the DNA-binding andactivation domains of the transcription factor are brought into closeproximity. This proximity allows transcription of a reporter gene (e.g.,lacZ) which is operably linked to a transcriptional regulatory siteresponsive to the transcription factor. Expression of the reporter genecan be detected and cell colonies containing the functionaltranscription factor can be isolated and used to obtain the cloned genewhich encodes the protein which interacts with the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein.

[0947] In another embodiment, modulators of 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 expression are identified. Forexample, a cell or cell free mixture is contacted with a candidatecompound and the expression of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 mRNA or protein evaluated relative to thelevel of expression of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 mRNA or protein in the absence of the candidatecompound. When expression of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 mRNA or protein is greater in the presence ofthe candidate compound than in its absence, the candidate compound isidentified as a stimulator of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 mRNA or protein expression. Alternatively,when expression of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 mRNA or protein is less (statistically significantlyless) in the presence of the candidate compound than in its absence, thecandidate compound is identified as an inhibitor of 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 mRNA or proteinexpression. The level of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 mRNA or protein expression can be determined bymethods described herein for detecting 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 mRNA or protein.

[0948] In another aspect, the invention pertains to a combination of twoor more of the assays described herein. For example, a modulating agentcan be identified using a cell-based or a cell free assay, and theability of the agent to modulate the activity of a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein can be confirmedin vivo, e.g., in an animal such as an animal model for aberrant ordeficient guanylate kinase activity, phophatidylinositol 4-phosphate5-kinase activity, kinase activity, transferase activity, aminopeptidaseactivity, adenylate cyclase activity, calpain protease activity,oxidoreductase activity, neprilysin protease activity, AMP bindingenzyme activity and lysyl oxidase activity, or other activity.

[0949] This invention further pertains to novel agents identified by theabove-described screening assays. Accordingly, it is within the scope ofthis' invention to further use an agent identified as described herein(e.g., a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 modulating agent, an antisense 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 nucleic acid molecule, a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593-specificantibody, or a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593-binding partner) in an appropriate animal model todetermine the efficacy, toxicity, side effects, or mechanism of action,of treatment with such an agent. Furthermore, novel agents identified bythe above-described screening assays can be used for treatments asdescribed herein.

[0950] Detection Assays

[0951] Portions or fragments of the nucleic acid sequences identifiedherein can be used as polynucleotide reagents. For example, thesesequences can be used to: (i) map their respective genes on a chromosomee.g., to locate gene regions associated with genetic disease or toassociate 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 with a disease; (ii) identify an individual from a minute biologicalsample (tissue typing); and (iii) aid in forensic identification of abiological sample. These applications are described in the subsectionsbelow.

[0952] Chromosome Mapping

[0953] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 nucleotide sequences or portions thereof can be used to map thelocation of the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 genes on a chromosome. This process is called chromosomemapping. Chromosome mapping is useful in correlating the 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 sequences withgenes associated with disease.

[0954] Briefly, 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 genes can be mapped to chromosomes by preparing PCR primers(preferably 15-25 bp in length) from the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 nucleotide sequences. Theseprimers can then be used for PCR screening of somatic cell hybridscontaining individual human chromosomes. Only those hybrids containingthe human gene corresponding to the 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 5663.8, 18610, 33217,21967, h1983, m1983, 38555 or 593 sequences will yield an amplifiedfragment.

[0955] A panel of somatic cell hybrids in which each cell line containseither a single human chromosome or a small number of human chromosomes,and a full set of mouse chromosomes, can allow easy mapping ofindividual genes to specific human chromosomes. (D'Eustachio et al.(1983) Science 220:919-924).

[0956] Other mapping strategies e.g., in situ hybridization (describedin Fan et al. (1990) Proc. Natl. Acad. Sci. USA, 87:6223-27),pre-screening with labeled flow-sorted chromosomes, and pre-selection byhybridization to chromosome specific cDNA libraries can be used to map21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 toa chromosomal location.

[0957] Fluorescence in situ hybridization (FISH) of a DNA sequence to ametaphase chromosomal spread can further be used to provide a precisechromosomal location in one step. The FISH technique can be used with aDNA sequence as short as 500 or 600 bases. However, clones larger than1,000 bases have a higher likelihood of binding to a unique chromosomallocation with sufficient signal intensity for simple detection.Preferably 1,000 bases, and more preferably 2,000 bases will suffice toget good results at a reasonable amount of time. For a review of thistechnique, see Verma et al. (1988) Human Chromosomes: A Manual of BasicTechniques, Pergamon Press, New York).

[0958] Reagents for chromosome mapping can be used individually to marka single chromosome or a single site on that chromosome, or panels ofreagents can be used for marking multiple sites and/or multiplechromosomes. Reagents corresponding to noncoding regions of the genesactually are preferred for mapping purposes. Coding sequences are morelikely to be conserved within gene families, thus increasing the chanceof cross hybridizations during chromosomal mapping.

[0959] Once a sequence has been mapped to a precise chromosomallocation, the physical position of the sequence on the chromosome can becorrelated with genetic map data. (Such data are found, for example, inMcKusick, Mendelian Inheritance in Man, available on-line through JohnsHopkins University Welch Medical Library). The relationship between agene and a disease, mapped to the same chromosomal region, can then beidentified through linkage analysis (co-inheritance of physicallyadjacent genes), described in, for example, Egeland et al. (1987)Nature, 325:783-787.

[0960] Moreover, differences in the DNA sequences between individualsaffected and unaffected with a disease associated with the 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 gene, can bedetermined. If a mutation is observed in some or all of the affectedindividuals but not in any unaffected individuals, then the mutation islikely to be the causative agent of the particular disease. Comparisonof affected and unaffected individuals generally involves first lookingfor structural alterations in the chromosomes, such as deletions ortranslocations that are visible from chromosome spreads or detectableusing PCR based on that DNA sequence. Ultimately, complete sequencing ofgenes from several individuals can be performed to confirm the presenceof a mutation and to distinguish mutations from polymorphisms.

[0961] Tissue Typing

[0962] 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 sequences can be used to identify individuals from biologicalsamples using, e.g., restriction fragment length polymorphism (RFLP). Inthis technique, an individual's genomic DNA is digested with one or morerestriction enzymes, the fragments separated, e.g., in a Southern blot,and probed to yield bands for identification. The sequences of thepresent invention are useful as additional DNA markers for RFLP(described in U.S. Pat. No. 5,272,057).

[0963] Furthermore, the sequences of the present invention can also beused to determine the actual base-by-base DNA sequence of selectedportions of an individual's genome. Thus, the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 nucleotide sequences describedherein can be used to prepare two PCR primers from the 5′ and 3′ ends ofthe sequences. These primers can then be used to amplify an individual'sDNA and subsequently sequence it. Panels of corresponding DNA sequencesfrom individuals, prepared in this manner, can provide unique individualidentifications, as each individual will have a unique set of such DNAsequences due to allelic differences.

[0964] Allelic variation occurs to some degree in the coding regions ofthese sequences, and to a greater degree in the noncoding regions. Eachof the sequences described herein can, to some degree, be used as astandard against which DNA from an individual can be compared foridentification purposes. Because greater numbers of polymorphisms occurin the noncoding regions, fewer sequences are necessary to differentiateindividuals. The noncoding sequences of SEQ ID NO:1, 5, 10, 18, 21, 24,31, 39, 43, 46, 49, 54, 57, 63, 66, 71, 88, 104, 107 or 111 can providepositive individual identification with a panel of perhaps 10 to 1,000primers which each yield a noncoding amplified sequence of 100 bases. Ifpredicted coding sequences, such as those in SEQ ID NO:3, 7, 12, 20, 23,26, 33, 41, 45, 48, 51, 56, 59, 65, 68, 73, 90, 106, 109 or 113 areused, a more appropriate number of primers for positive individualidentification would be 5002,000.

[0965] If a panel of reagents from 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 nucleotide sequences described hereinis used to generate a unique identification database for an individual,those same reagents can later be used to identify tissue from thatindividual. Using the unique identification database, positiveidentification of the individual, living or dead, can be made fromextremely small tissue samples.

[0966] Use of Partial 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 Sequences in Forensic Biology

[0967] DNA-based identification techniques can also be used in forensicbiology. To make such an identification, PCR technology can be used toamplify DNA sequences taken from very small biological samples such astissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, orsemen found at a crime scene. The amplified sequence can then becompared to a standard, thereby allowing identification of the origin ofthe biological sample.

[0968] The sequences of the present invention can be used to providepolynucleotide reagents, e.g., PCR primers, targeted to specific loci inthe human genome, which can enhance the reliability of DNA-basedforensic identifications by, for example, providing another“identification marker” (i.e. another DNA sequence that is unique to aparticular individual). As mentioned above, actual base sequenceinformation can be used for identification as an accurate alternative topatterns formed by restriction enzyme generated fragments. Sequencestargeted to noncoding regions of SEQ ID NO:1, 5, 10, 18, 21, 24, 31, 39,43, 46, 49, 54, 57, 63, 66, 71, 88, 104, 107 or 111 (e.g., fragmentsderived from the noncoding regions of SEQ ID NO:1, 5, 10, 18, 21, 24,31, 39, 43, 46, 49, 54, 57, 63, 66, 71, 88, 104, 107 or 111 having alength of at least 20 bases, preferably at least 30 bases) areparticularly appropriate for this use.

[0969] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 nucleotide sequences described herein can further be used to providepolynucleotide reagents, e.g., labeled or labelable probes which can beused in, for example, an in situ hybridization technique, to identify aspecific tissue. This can be very useful in cases where a forensicpathologist is presented with a tissue of unknown origin. Panels of such21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593probes can be used to identify tissue by species and/or by organ type.

[0970] In a similar fashion, these reagents, e.g., 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 primers or probes can beused to screen tissue culture for contamination (i.e. screen for thepresence of a mixture of different types of cells in a culture).

[0971] Predictive Medicine

[0972] The present invention also pertains to the field of predictivemedicine in which diagnostic assays, prognostic assays, and monitoringclinical trials are used for prognostic (predictive) purposes to therebytreat an individual.

[0973] Generally, the invention provides, a method of determining if asubject is at risk for a disorder related to a lesion in or themisexpression of a gene which encodes 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593.

[0974] Such disorders include, e.g., a disorder associated with themisexpression of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 gene; cellular proliferative and/or differentiativedisorders, brain disorders, platelet disorders, breast disorders, colondisorders, kidney (renal) disorders, lung disorders, ovarian disorders,prostate disorders, cervical disorders, spleen disorders, thymusdisorders, thyroid disorders, testes disorders, hematopoeitic disorders,pancreatic disorders, skeletal muscle disorders, skin (dermal)disorders, disorders associated with bone metabolism, immune, e.g.,inflammatory, disorders, cardiovascular disorders, endothelial celldisorders, liver disorders, viral diseases, pain disorders, metabolicdisorders, neurological or CNS disorders, erythroid disorders, bloodvessel disorders or angiogenic disorders.

[0975] The method includes one or more of the following: detecting, in atissue of the subject, the presence or absence of a mutation whichaffects the expression of the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 gene, or detecting the presence or absence ofa mutation in a region which controls the expression of the gene, e.g.,a mutation in the 5′ control region; detecting, in a tissue of thesubject, the presence or absence of a mutation which alters thestructure of the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 gene; detecting, in a tissue of the subject, themisexpression of the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 gene, at the mRNA level, e.g., detecting a non-wildtype level of an mRNA; or detecting, in a tissue of the subject, themisexpression of the gene, at the protein level, e.g., detecting anon-wild type level of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 polypeptide.

[0976] In preferred embodiments the method includes: ascertaining theexistence of at least one of: a deletion of one or more nucleotides fromthe 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593gene; an insertion of one or more nucleotides into the gene, a pointmutation, e.g., a substitution of one or more nucleotides of the gene, agross chromosomal rearrangement of the gene, e.g., a translocation,inversion, or deletion.

[0977] For example, detecting the genetic lesion can include: (i)providing a probe/primer including an oligonucleotide containing aregion of nucleotide sequence which hybridizes to a sense or antisensesequence from SEQ ID NO:1, 5, 10, 18, 21, 24, 31, 39, 43, 46, 49, 54,57, 63, 66, 71, 88, 104, 107 or 111, or naturally occurring mutantsthereof or 5′ or 3′ flanking sequences naturally associated with the21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593gene; (ii) exposing the probe/primer to nucleic acid of the tissue; anddetecting, by hybridization, e.g., in situ hybridization, of theprobe/primer to the nucleic acid, the presence or absence of the geneticlesion.

[0978] In preferred embodiments detecting the misexpression includesascertaining the existence of at least one of: an alteration in thelevel of a messenger RNA transcript of the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 gene; the presence of anon-wild type splicing pattern of a messenger RNA transcript of thegene; or a non-wild type level of 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593.

[0979] Methods of the invention can be used prenatally or to determineif a subject's offspring will be at risk for a disorder.

[0980] In preferred embodiments the method includes determining thestructure of a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 gene, an abnormal structure being indicative of risk forthe disorder.

[0981] In preferred embodiments the method includes contacting a samplefrom the subject with an antibody to the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein or a nucleic acid,which hybridizes specifically with the gene. These and other embodimentsare discussed below.

[0982] Diagnostic and Prognostic Assays

[0983] The presence, level, or absence of 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein or nucleic acid in abiological sample can be evaluated by obtaining a biological sample froma test subject and contacting the biological sample with a compound oran agent capable of detecting 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein or nucleic acid (e.g., mRNA, genomicDNA) that encodes 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein such that the presence of 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein or nucleic acidis detected in the biological sample. The term “biological sample”includes tissues, cells and biological fluids isolated from a subject,as well as tissues, cells and fluids present within a subject. Apreferred biological sample is serum. The level of expression of the21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593gene can be measured in a number of ways, including, but not limited to:measuring the mRNA encoded by the 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 genes; measuring the amount of proteinencoded by the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 genes; or measuring the activity of the protein encoded bythe 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593genes.

[0984] The level of mRNA corresponding to the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 gene in a cell can bedetermined both by in situ and by in vitro formats.

[0985] The isolated mRNA can be used in hybridization or amplificationassays that include, but are not limited to, Southern or Northernanalyses, polymerase chain reaction analyses and probe arrays. Onepreferred diagnostic method for the detection of mRNA levels involvescontacting the isolated mRNA with a nucleic acid molecule (probe) thatcan hybridize to the mRNA encoded by the gene being detected. Thenucleic acid probe can be, for example, a full-length 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleic acid,such as the nucleic acid of SEQ ID NO:1, 5, 10, 18, 21, 24, 31, 39, 43,46, 49, 54, 57, 63, 66, 71, 88, 104, 107 or 111, or a portion thereof,such as an oligonucleotide of at least 7, 15, 30, 50, 100, 250 or 500nucleotides in length and sufficient to specifically hybridize understringent conditions to 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 mRNA or genomic DNA. Other suitable probes for usein the diagnostic assays are described herein.

[0986] In one format, mRNA (or cDNA) is immobilized on a surface andcontacted with the probes, for example by running the isolated mRNA onan agarose gel and transferring the mRNA from the gel to a membrane,such as nitrocellulose. In an alternative format, the probes areimmobilized on a surface and the mRNA (or cDNA) is contacted with theprobes, for example, in a two-dimensional gene chip array. A skilledartisan can adapt known mRNA detection methods for use in detecting thelevel of mRNA encoded by the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 genes.

[0987] The level of mRNA in a sample that is encoded by one of 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 can beevaluated with nucleic acid amplification, e.g., by rtPCR (Mullis (1987)U.S. Pat. No. 4,683,202), ligase chain reaction (Barany (1991) Proc.Natl. Acad. Sci. USA 88:189-193), self sustained sequence replication(Guatelli et al., (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878),transcriptional amplification system (Kwoh et al., (1989), Proc. Natl.Acad. Sci. USA 86:1173-1177), Q Beta Replicase (Lizardi et al., (1988)Bio/Technology 6:1197), rolling circle replication (Lizardi et al., U.S.Pat. No. 5,854,033) or any other nucleic acid amplification method,followed by the detection of the amplified molecules using techniquesknown in the art. As used herein, amplification primers are defined asbeing a pair of nucleic acid molecules that can anneal to 5′ or 3′regions of a gene (plus and minus strands, respectively, or vice-versa)and contain a short region in between. In general, amplification primersare from about 10 to 30 nucleotides in length and flank a region fromabout 50 to 200 nucleotides in length. Under appropriate conditions andwith appropriate reagents, such primers permit the amplification of anucleic acid molecule comprising the nucleotide sequence flanked by theprimers.

[0988] For in situ methods, a cell or tissue sample can beprepared/processed and immobilized on a support, typically a glassslide, and then contacted with a probe that can hybridize to mRNA thatencodes the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 gene being analyzed.

[0989] In another embodiment, the methods further contacting a controlsample with a compound or agent capable of detecting 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 mRNA, or genomicDNA, and comparing the presence of 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 mRNA or genomic DNA in the controlsample with the presence of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 mRNA or genomic DNA in the test sample.

[0990] A variety of methods can be used to determine the level ofprotein encoded by 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593. In general, these methods include contacting anagent that selectively binds to the protein, such as an antibody with asample, to evaluate the level of protein in the sample. In a preferredembodiment, the antibody bears a detectable label. Antibodies can bepolyclonal, or more preferably, monoclonal. An intact antibody, or afragment thereof (e.g., Fab or F(ab′)₂) can be used. The term “labeled”,with regard to the probe or antibody, is intended to encompass directlabeling of the probe or antibody by coupling (i.e., physically linking)a detectable substance to the probe or antibody, as well as indirectlabeling of the probe or antibody by reactivity with a detectablesubstance. Examples of detectable substances are provided herein.

[0991] The detection methods can be used to detect 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein in a biologicalsample in vitro as well as in vivo. In vitro techniques for detection of21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein include enzyme linked immunosorbent assays (ELISAs),immunoprecipitations, immunofluorescence, enzyme immunoassay (EIA),radioimmunoassay (RIA), and Western blot analysis. In vivo techniquesfor detection of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein include introducing into a subject a labeledanti-21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593antibody. For example, the antibody can be labeled with a radioactivemarker whose presence and location in a subject can be detected bystandard imaging techniques.

[0992] In another embodiment, the methods further include contacting thecontrol sample with a compound or agent capable of detecting 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein,and comparing the presence of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein in the control sample with thepresence of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein in the test sample.

[0993] The invention also includes kits for detecting the presence of21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 ina biological sample. For example, the kit can include a compound oragent capable of detecting 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein or mRNA in a biological sample; and astandard. The compound or agent can be packaged in a suitable container.The kit can further comprise instructions for using the kit to detect21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein or nucleic acid.

[0994] For antibody-based kits, the kit can include: (1) a firstantibody (e.g., attached to a solid support) which binds to apolypeptide corresponding to a marker of the invention; and, optionally,(2) a second, different antibody which binds to either the polypeptideor the first antibody and is conjugated to a detectable agent.

[0995] For oligonucleotide-based kits, the kit can include: (1) anoligonucleotide, e.g., a detectably labeled oligonucleotide, whichhybridizes to a nucleic acid sequence encoding a polypeptidecorresponding to a marker of the invention or (2) a pair of primersuseful for amplifying a nucleic acid molecule corresponding to a markerof the invention. The kit can also includes a buffering agent, apreservative, or a protein stabilizing agent. The kit can also includescomponents necessary for detecting the detectable agent (e.g., an enzymeor a substrate). The kit can also contain a control sample or a seriesof control samples which can be assayed and compared to the test samplecontained. Each component of the kit can be enclosed within anindividual container and all of the various containers can be within asingle package, along with instructions for interpreting the results ofthe assays performed using the kit.

[0996] The diagnostic methods described herein can identify subjectshaving, or at risk of developing, a disease or disorder associated withmisexpressed or aberrant or unwanted 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 expression or activity. As usedherein, the term “unwanted” includes an unwanted phenomenon involved ina biological response such as pain or deregulated cell proliferation.

[0997] In one embodiment, a disease or disorder associated with aberrantor unwanted 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 expression or activity is identified. A test sample is obtained froma subject and 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein or nucleic acid (e.g., mRNA or genomic DNA) isevaluated, wherein the level, e.g., the presence or absence, of 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 protein ornucleic acid is diagnostic for a subject having or at risk of developinga disease or disorder associated with aberrant or unwanted 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 expression oractivity. As used herein, a “test sample” refers to a biological sampleobtained from a subject of interest, including a biological fluid (e.g.,serum), cell sample, or tissue.

[0998] The prognostic assays described herein can be used to determinewhether a subject can be administered an agent (e.g., an agonist,antagonist, peptidomimetic, protein, peptide, nucleic acid, smallmolecule, or other drug candidate) to treat a disease or disorderassociated with aberrant or unwanted 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 expression or activity. For example,such methods can be used to determine whether a subject can beeffectively treated with an agent for a cellular proliferative and/ordifferentiative disorder, brain disorder, platelet disorder, breastdisorder, colon disorder, kidney (renal) disorder, lung disorder,ovarian disorder, prostate disorder, cervical disorder, spleen disorder,thymus disorder, thyroid disorder, testes disorder, hematopoeiticdisorder, pancreatic disorder, skeletal muscle disorder, skin (dermal)disorder, disorder associated with bone metabolism, immune, e.g.,inflammatory, disorder, cardiovascular disorder, endothelial celldisorder, liver disorder, viral disease, pain disorder, metabolicdisorder, neurological or CNS disorder, erythroid disorder, blood vesseldisorder or angiogenic disorder.

[0999] The methods of the invention can also be used to detect geneticalterations in a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 gene, thereby determining if a subject with the alteredgene is at risk for a disorder characterized by misregulation in 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 proteinactivity or nucleic acid expression, such as a cellular proliferativeand/or differentiative disorder, brain disorder, platelet disorder,breast disorder, colon disorder, kidney (renal) disorder, lung disorder,ovarian disorder, prostate disorder, cervical disorder, spleen disorder,thymus disorder, thyroid disorder, testes disorder, hematopoeiticdisorder, pancreatic disorder, skeletal muscle disorder, skin (dermal)disorder, disorder associated with bone metabolism, immune, e.g.,inflammatory, disorder, cardiovascular disorder, endothelial celldisorder, liver disorder, viral disease, pain disorder, metabolicdisorder, neurological or CNS disorder, erythroid disorder, blood vesseldisorder or angiogenic disorder. In preferred embodiments, the methodsinclude detecting, in a sample from the subject, the presence or absenceof a genetic alteration characterized by at least one of an alterationaffecting the integrity of a gene encoding a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593-protein, or the mis-expressionof the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 gene. For example, such genetic alterations can be detected byascertaining the existence of at least one of 1) a deletion of one ormore nucleotides from a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 gene; 2) an addition of one or more nucleotides to a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593gene; 3) a substitution of one or more nucleotides of a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 gene, 4) achromosomal rearrangement of a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 gene; 5) an alteration in the level of amessenger RNA transcript of a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 gene, 6) aberrant modification of a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 gene, suchas of the methylation pattern of the genomic DNA, 7) the presence of anon-wild type splicing pattern of a messenger RNA transcript of a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 gene, 8) anon-wild type level of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593-protein, 9) allelic loss of a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 gene, and 10)inappropriate post-translational modification of a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593-protein.

[1000] An alteration can be detected without a probe/primer in apolymerase chain reaction, such as anchor PCR or RACE PCR, or,alternatively, in a ligation chain reaction (LCR), the latter of whichcan be particularly useful for detecting point mutations in the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593-gene. Thismethod can include the steps of collecting a sample of cells from asubject, isolating nucleic acid (e.g., genomic, mRNA or both) from thesample, contacting the nucleic acid sample with one or more primerswhich specifically hybridize to a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 gene under conditions such thathybridization and amplification of the 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 gene (if present) occurs, anddetecting the presence or absence of an amplification product, ordetecting the size of the amplification product and comparing the lengthto a control sample. It is anticipated that PCR and/or LCR may bedesirable to use as a preliminary amplification step in conjunction withany of the techniques used for detecting mutations described herein.Alternatively, other amplification methods described herein or known inthe art can be used.

[1001] In another embodiment, mutations in a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 gene from a sample cell can beidentified by detecting alterations in restriction enzyme cleavagepatterns. For example, sample and control DNA is isolated, amplified(optionally), digested with one or more restriction endonucleases, andfragment length sizes are determined, e.g., by gel electrophoresis andcompared. Differences in fragment length sizes between sample andcontrol DNA indicates mutations in the sample DNA. Moreover, the use ofsequence specific ribozymes (see, for example, U.S. Pat. No. 5,498,531)can be used to score for the presence of specific mutations bydevelopment or loss of a ribozyme cleavage site.

[1002] In other embodiments, genetic mutations in 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 can be identified byhybridizing a sample and control nucleic acids, e.g., DNA or RNA, twodimensional arrays, e.g., chip based arrays. Such arrays include aplurality of addresses, each of which is positionally distinguishablefrom the other. A different probe is located at each address of theplurality. The arrays can have a high density of addresses, e.g., cancontain hundreds or thousands of oligonucleotides probes (Cronin et al.(1996) Human Mutation 7: 244-255; Kozal et al. (1996) Nature Medicine 2:753-759). For example, genetic mutations in 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 can be identified in twodimensional arrays containing light-generated DNA probes as described inCronin, M. T. et al. supra. Briefly, a first hybridization array ofprobes can be used to scan through long stretches of DNA in a sample andcontrol to identify base changes between the sequences by making lineararrays of sequential overlapping probes. This step allows theidentification of point mutations. This step is followed by a secondhybridization array that allows the characterization of specificmutations by using smaller, specialized probe arrays complementary toall variants or mutations detected. Each mutation array is composed ofparallel probe sets, one complementary to the wild-type gene and theother complementary to the mutant gene.

[1003] In yet another embodiment, any of a variety of sequencingreactions known in the art can be used to directly sequence the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 gene anddetect mutations by comparing the sequence of the sample 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 with thecorresponding wild-type (control) sequence. Automated sequencingprocedures can be utilized when performing the diagnostic assays (Naeveet al. (1995) Biotechniques 19:448-53), including sequencing by massspectrometry.

[1004] Other methods for detecting mutations in the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 gene include methods inwhich protection from cleavage agents is used to detect mismatched basesin RNA/RNA or RNA/DNA heteroduplexes (Myers et al. (1985) Science230:1242; Cotton et al. (1988) Proc. Natl Acad Sci USA 85:4397; Saleebaet al. (1992) Methods Enzymol. 217:286-295).

[1005] In still another embodiment, the mismatch cleavage reactionemploys one or more proteins that recognize mismatched base pairs indouble-stranded DNA (so called “DNA mismatch repair” enzymes) in definedsystems for detecting and mapping point mutations in 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 cDNAs obtainedfrom samples of cells. For example, the mutY enzyme of E. coli cleaves Aat G/A mismatches and the thymidine DNA glycosylase from HeLa cellscleaves T at G/T mismatches (Hsu et al. (1994) Carcinogenesis15:1657-1662; U.S. Pat. No. 5,459,039).

[1006] In other embodiments, alterations in electrophoretic mobilitywill be used to identify mutations in 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 genes. For example, single strandconformation polymorphism (SSCP) can be used to detect differences inelectrophoretic mobility between mutant and wild type nucleic acids(Orita et al. (1989) Proc Natl. Acad. Sci USA: 86:2766, see also Cotton(1993) Mutat. Res. 285:125-144; and Hayashi (1992) Genet. Anal. Tech.Appl. 9:73-79). Single-stranded DNA fragments of sample and control21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593nucleic acids will be denatured and allowed to renature. The secondarystructure of single-stranded nucleic acids varies according to sequence,the resulting alteration in electrophoretic mobility enables thedetection of even a single base change. The DNA fragments can be labeledor detected with labeled probes. The sensitivity of the assay can beenhanced by using RNA (rather than DNA), in which the secondarystructure is more sensitive to a change in sequence. In a preferredembodiment, the subject method utilizes heteroduplex analysis toseparate double stranded heteroduplex molecules on the basis of changesin electrophoretic mobility (Keen et al. (1991) Trends Genet 7:5).

[1007] In yet another embodiment, the movement of mutant or wild-typefragments in polyacrylamide gels containing a gradient of denaturant isassayed using denaturing gradient gel electrophoresis (DGGE) (Myers etal. (1985) Nature 313:495). When DGGE is used as the method of analysis,DNA will be modified to insure that it does not completely denature, forexample by adding a GC clamp of approximately 40 bp of high-meltingGC-rich DNA by PCR. In a further embodiment, a temperature gradient isused in place of a denaturing gradient to identify differences in themobility of control and sample DNA (Rosenbaum and Reissner (1987)Biophys Chem 265:12753).

[1008] Examples of other techniques for detecting point mutationsinclude, but are not limited to, selective oligonucleotidehybridization, selective amplification, or selective primer extension(Saiki et al. (1986) Nature 324:163); Saiki et al. (1989) Proc. NatlAcad. Sci USA 86:6230).

[1009] Alternatively, allele specific amplification technology whichdepends on selective PCR amplification can be used in conjunction withthe instant invention. Oligonucleotides used as primers for specificamplification can carry the mutation of interest in the center of themolecule (so that amplification depends on differential hybridization)(Gibbs et al. (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme3′ end of one primer where, under appropriate conditions, mismatch canprevent, or reduce polymerase extension (Prossner (1993) Tibtech11:238). In addition it may be desirable to introduce a novelrestriction site in the region of the mutation to create cleavage-baseddetection (Gasparini et al. (1992) Mol. Cell Probes 6:1). It isanticipated that in certain embodiments amplification can also beperformed using Taq ligase for amplification (Barany (1991) Proc. Natl.Acad. Sci USA 88:189-93). In such cases, ligation will occur only ifthere is a perfect match at the 3′ end of the 5′ sequence making itpossible to detect the presence of a known mutation at a specific siteby looking for the presence or absence of amplification.

[1010] The methods described herein can be performed, for example, byutilizing pre-packaged diagnostic kits comprising at least one probenucleic acid or antibody reagent described herein, which can beconveniently used, e.g., in clinical settings to diagnose patientsexhibiting symptoms or family history of a disease or illness involvinga 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593gene.

[1011] Use of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 Molecules as Surrogate Markers

[1012] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 molecules of the invention are also useful as markers of disordersor disease states, as markers for precursors of disease states, asmarkers for predisposition of disease states, as markers of drugactivity, or as markers of the pharmacogenomic profile of a subject.Using the methods described herein, the presence, absence and/orquantity of the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 molecules of the invention can be detected, and can becorrelated with one or more biological states in vivo. For example, the21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593molecules of the invention can serve as surrogate markers for one ormore disorders or disease states or for conditions leading up to diseasestates. As used herein, a “surrogate marker” is an objective biochemicalmarker which correlates with the absence or presence of a disease ordisorder, or with the progression of a disease or disorder (e.g., withthe presence or absence of a tumor). The presence or quantity of suchmarkers is independent of the disease. Therefore, these markers canserve to indicate whether a particular course of treatment is effectivein lessening a disease state or disorder. Surrogate markers are ofparticular use when the presence or extent of a disease state ordisorder is difficult to assess through standard methodologies (e.g.,early stage tumors), or when an assessment of disease progression isdesired before a potentially dangerous clinical endpoint is reached(e.g., an assessment of cardiovascular disease can be made usingcholesterol levels as a surrogate marker, and an analysis of HIVinfection can be made using HIV RNA levels as a surrogate marker, wellin advance of the undesirable clinical outcomes of myocardial infarctionor fully-developed AIDS). Examples of the use of surrogate markers inthe art include: Koomen et al. (2000) J. Mass. Spectrom. 35: 258-264;and James (1994) AIDS Treatment News Archive 209.

[1013] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 molecules of the invention are also useful as pharmacodynamicmarkers. As used herein, a “pharmacodynamic marker” is an objectivebiochemical marker which correlates specifically with drug effects. Thepresence or quantity of a pharmacodynamic marker is not related to thedisease state or disorder for which the drug is being administered;therefore, the presence or quantity of the marker is indicative of thepresence or activity of the drug in a subject. For example, apharmacodynamic marker can be indicative of the concentration of thedrug in a biological tissue, in that the marker is either expressed ortranscribed or not expressed or transcribed in that tissue inrelationship to the level of the drug. In this fashion, the distributionor uptake of the drug can be monitored by the pharmacodynamic marker.Similarly, the presence or quantity of the pharmacodynamic marker can berelated to the presence or quantity of the metabolic product of a drug,such that the presence or quantity of the marker is indicative of therelative breakdown rate of the drug in vivo. Pharmacodynamic markers areof particular use in increasing the sensitivity of detection of drugeffects, particularly when the drug is administered in low doses. Sinceeven a small amount of a drug can be sufficient to activate multiplerounds of marker (e.g., a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 marker) transcription or expression, theamplified marker can be in a quantity which is more readily detectablethan the drug itself. Also, the marker can be more easily detected dueto the nature of the marker itself; for example, using the methodsdescribed herein, anti-21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 antibodies can be employed in an immune-baseddetection system for a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 protein marker, or 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593-specific radiolabeled probes can beused to detect a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 mRNA marker. Furthermore, the use of a pharmacodynamicmarker can offer mechanism-based prediction of risk due to drugtreatment beyond the range of possible direct observations. Examples ofthe use of pharmacodynamic markers in the art include: Matsuda et al.U.S. Pat. No. 6,033,862; Hattis et al. (1991) Env. Health Perspect. 90:229-238; Schentag (1999) Am. J. Health-Syst. Pharm. 56 Suppl. 3:S21-S24; and Nicolau (1999) Am. J. Health-Syst. Pharm. 56 Suppl. 3:S16-S20.

[1014] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 molecules of the invention are also useful as pharmacogenomicmarkers. As used herein, a “pharmacogenomic marker” is an objectivebiochemical marker which correlates with a specific clinical drugresponse or susceptibility in a subject (see, e.g., McLeod et al. (1999)Eur. J. Cancer 35:1650-1652). The presence or quantity of thepharmacogenomic marker is related to the predicted response of thesubject to a specific drug or class of drugs prior to administration ofthe drug. By assessing the presence or quantity of one or morepharmacogenomic markers in a subject, a drug therapy which is mostappropriate for the subject, or which is predicted to have a greaterdegree of success, can be selected. For example, based on the presenceor quantity of RNA, or protein (e.g., 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 protein or RNA) for specific tumormarkers in a subject, a drug or course of treatment can be selected thatis optimized for the treatment of the specific tumor likely to bepresent in the subject. Similarly, the presence or absence of a specificsequence mutation in 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 DNA can correlate with a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 drug response. The use ofpharmacogenomic markers therefore permits the application of the mostappropriate treatment for each subject without having to administer thetherapy.

[1015] Pharmaceutical Compositions

[1016] The nucleic acid and polypeptides, fragments thereof, as well asanti-21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593antibodies (also referred to herein as “active compounds”) of theinvention can be incorporated into pharmaceutical compositions. Suchcompositions typically include the nucleic acid molecule, protein, orantibody and a pharmaceutically acceptable carrier. As used herein thelanguage “pharmaceutically acceptable carrier” includes solvents,dispersion media, coatings, antibacterial and antifungal agents,isotonic and absorption delaying agents, and the like, compatible withpharmaceutical administration. Supplementary active compounds can alsobe incorporated into the compositions.

[1017] A pharmaceutical composition is formulated to be compatible withits intended route of administration. Examples of routes ofadministration include parenteral, e.g., intravenous, intradermal,subcutaneous, oral (e.g., inhalation), transdermal (topical),transmucosal, and rectal administration. Solutions or suspensions usedfor parenteral, intradermal, or subcutaneous application can include thefollowing components: a sterile diluent such as water for injection,saline solution, fixed oils, polyethylene glycols, glycerine, propyleneglycol or other synthetic solvents; antibacterial agents such as benzylalcohol or methyl parabens; antioxidants such as ascorbic acid or sodiumbisulfite; chelating agents such as ethylenediaminetetraacetic acid;buffers such as acetates, citrates or phosphates and agents for theadjustment of tonicity such as sodium chloride or dextrose. pH can beadjusted with acids or bases, such as hydrochloric acid or sodiumhydroxide. The parenteral preparation can be enclosed in ampoules,disposable syringes or multiple dose vials made of glass or plastic.

[1018] Pharmaceutical compositions suitable for injectable use includesterile aqueous solutions (where water soluble) or dispersions andsterile powders for the extemporaneous preparation of sterile injectablesolutions or dispersion. For intravenous administration, suitablecarriers include physiological saline, bacteriostatic water, CremophorEL™ (BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In allcases, the composition must be sterile and should be fluid to the extentthat easy syringability exists. It should be stable under the conditionsof manufacture and storage and must be preserved against thecontaminating action of microorganisms such as bacteria and fungi. Thecarrier can be a solvent or dispersion medium containing, for example,water, ethanol, polyol (for example, glycerol, propylene glycol, andliquid polyetheylene glycol, and the like), and suitable mixturesthereof. The proper fluidity can be maintained, for example, by the useof a coating such as lecithin, by the maintenance of the requiredparticle size in the case of dispersion and by the use of surfactants.Prevention of the action of microorganisms can be achieved by variousantibacterial and antifungal agents, for example, parabens,chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In manycases, it will be preferable to include isotonic agents, for example,sugars, polyalcohols such as manitol, sorbitol, sodium chloride in thecomposition. Prolonged absorption of the injectable compositions can bebrought about by including in the composition an agent which delaysabsorption, for example, aluminum monostearate and gelatin.

[1019] Sterile injectable solutions can be prepared by incorporating theactive compound in the required amount in an appropriate solvent withone or a combination of ingredients enumerated above, as required,followed by filtered sterilization. Generally, dispersions are preparedby incorporating the active compound into a sterile vehicle whichcontains a basic dispersion medium and the required other ingredientsfrom those enumerated above. In the case of sterile powders for thepreparation of sterile injectable solutions, the preferred methods ofpreparation are vacuum drying and freeze-drying which yields a powder ofthe active ingredient plus any additional desired ingredient from apreviously sterile-filtered solution thereof.

[1020] Oral compositions generally include an inert diluent or an ediblecarrier. For the purpose of oral therapeutic administration, the activecompound can be incorporated with excipients and used in the form oftablets, troches, or capsules, e.g., gelatin capsules. Oral compositionscan also be prepared using a fluid carrier for use as a mouthwash.Pharmaceutically compatible binding agents, and/or adjuvant materialscan be included as part of the composition. The tablets, pills,capsules, troches and the like can contain any of the followingingredients, or compounds of a similar nature: a binder such asmicrocrystalline cellulose, gum tragacanth or gelatin; an excipient suchas starch or lactose, a disintegrating agent such as alginic acid,Primogel, or corn starch; a lubricant such as magnesium stearate orSterotes; a glidant such as colloidal silicon dioxide; a sweeteningagent such as sucrose or saccharin; or a flavoring agent such aspeppermint, methyl salicylate, or orange flavoring.

[1021] For administration by inhalation, the compounds are delivered inthe form of an aerosol spray from pressured container or dispenser whichcontains a suitable propellant, e.g., a gas such as carbon dioxide, or anebulizer.

[1022] Systemic administration can also be by transmucosal ortransdermal means. For transmucosal or transdermal administration,penetrants appropriate to the barrier to be permeated are used in theformulation. Such penetrants are generally known in the art, andinclude, for example, for transmucosal administration, detergents, bilesalts, and fusidic acid derivatives. Transmucosal administration can beaccomplished through the use of nasal sprays or suppositories. Fortransdermal administration, the active compounds are formulated intoointments, salves, gels, or creams as generally known in the art.

[1023] The compounds can also be prepared in the form of suppositories(e.g., with conventional suppository bases such as cocoa butter andother glycerides) or retention enemas for rectal delivery.

[1024] In one embodiment, the active compounds are prepared withcarriers that will protect the compound against rapid elimination fromthe body, such as a controlled release formulation, including implantsand microencapsulated delivery systems. Biodegradable, biocompatiblepolymers can be used, such as ethylene vinyl acetate, polyanhydrides,polyglycolic acid, collagen, polyorthoesters, and polylactic acid.Methods for preparation of such formulations will be apparent to thoseskilled in the art. The materials can also be obtained commercially fromAlza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions(including liposomes targeted to infected cells with monoclonalantibodies to viral antigens) can also be used as pharmaceuticallyacceptable carriers. These can be prepared according to methods known tothose skilled in the art, for example, as described in U.S. Pat. No.4,522,811.

[1025] It is advantageous to formulate oral or parenteral compositionsin dosage unit form for ease of administration and uniformity of dosage.Dosage unit form as used herein refers to physically discrete unitssuited as unitary dosages for the subject to be treated; each unitcontaining a predetermined quantity of active compound calculated toproduce the desired therapeutic effect in association with the requiredpharmaceutical carrier.

[1026] Toxicity and therapeutic efficacy of such compounds can bedetermined by standard pharmaceutical procedures in cell cultures orexperimental animals, e.g., for determining the LD₅₀ (the dose lethal to50% of the population) and the ED₅₀ (the dose therapeutically effectivein 50% of the population). The dose ratio between toxic and therapeuticeffects is the therapeutic index and it can be expressed as the ratioLD₅₀/ED₅₀. Compounds which exhibit high therapeutic indices arepreferred. While compounds that exhibit toxic side effects can be used,care should be taken to design a delivery system that targets suchcompounds to the site of affected tissue in order to minimize potentialdamage to uninfected cells and, thereby, reduce side effects.

[1027] The data obtained from the cell culture assays and animal studiescan be used in formulating a range of dosage for use in humans. Thedosage of such compounds lies preferably within a range of circulatingconcentrations that include the ED₅₀ with little or no toxicity. Thedosage can vary within this range depending upon the dosage formemployed and the route of administration utilized. For any compound usedin the method of the invention, the therapeutically effective dose canbe estimated initially from cell culture assays. A dose can beformulated in animal models to achieve a circulating plasmaconcentration range that includes the IC₅₀ (i.e., the concentration ofthe test compound which achieves a half-maximal inhibition of symptoms)as determined in cell culture. Such information can be used to moreaccurately determine useful doses in humans. Levels in plasma can bemeasured, for example, by high performance liquid chromatography.

[1028] As defined herein, a therapeutically effective amount of proteinor polypeptide (i.e., an effective dosage) ranges from about 0.001 to 30mg/kg body weight, preferably about 0.01 to 25 mg/kg body weight, morepreferably about 0.1 to 20 mg/kg body weight, and even more preferablyabout 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, or 5 to 6mg/kg body weight. The protein or polypeptide can be administered onetime per week for between about 1 to 10 weeks, preferably between 2 to 8weeks, more preferably between about 3 to 7 weeks, and even morepreferably for about 4, 5, or 6 weeks. The skilled artisan willappreciate that certain factors can influence the dosage and timingrequired to effectively treat a subject, including but not limited tothe severity of the disease or disorder, previous treatments, thegeneral health and/or age of the subject, and other diseases present.Moreover, treatment of a subject with a therapeutically effective amountof a protein, polypeptide, or antibody, unconjugated or conjugated asdescribed herein, can include a single treatment or, preferably, caninclude a series of treatments.

[1029] For antibodies, the preferred dosage is 0.1 mg/kg of body weight(generally 10 mg/kg to 20 mg/kg). If the antibody is to act in thebrain, a dosage of 50 mg/kg to 100 mg/kg is usually appropriate.Generally, partially human antibodies and fully human antibodies have alonger half-life within the human body than other antibodies.Accordingly, lower dosages and less frequent administration is oftenpossible. Modifications such as lipidation can be used to stabilizeantibodies and to enhance uptake and tissue penetration (e.g., into thebrain). A method for lipidation of antibodies is described by Cruikshanket al. ((1997) J. Acquired Immune Deficiency Syndromes and HumanRetrovirology 14:193).

[1030] The present invention encompasses agents which modulateexpression or activity. An agent can, for example, be a small molecule.For example, such small molecules include, but are not limited to,peptides, peptidomimetics (e.g., peptoids), amino acids, amino acidanalogs, polynucleotides, polynucleotide analogs, nucleotides,nucleotide analogs, organic or inorganic compounds (i.e., includingheteroorganic and organometallic compounds) having a molecular weightless than about 10,000 grams per mole, organic or inorganic compoundshaving a molecular weight less than about 5,000 grams per mole, organicor inorganic compounds having a molecular weight less than about 1,000grams per mole, organic or inorganic compounds having a molecular weightless than about 500 grams per mole, and salts, esters, and otherpharmaceutically acceptable forms of such compounds.

[1031] Exemplary doses include milligram or microgram amounts of thesmall molecule per kilogram of subject or sample weight (e.g., about 1microgram per kilogram to about 500 milligrams per kilogram, about 100micrograms per kilogram to about 5 milligrams per kilogram, or about 1microgram per kilogram to about 50 micrograms per kilogram. It isfurthermore understood that appropriate doses of a small molecule dependupon the potency of the small molecule with respect to the expression oractivity to be modulated. When one or more of these small molecules isto be administered to an animal (e.g., a human) in order to modulateexpression or activity of a polypeptide or nucleic acid of theinvention, a physician, veterinarian, or researcher can, for example,prescribe a relatively low dose at first, subsequently increasing thedose until an appropriate response is obtained. In addition, it isunderstood that the specific dose level for any particular animalsubject will depend upon a variety of factors including the activity ofthe specific compound employed, the age, body weight, general health,gender, and diet of the subject, the time of administration, the routeof administration, the rate of excretion, any drug combination, and thedegree of expression or activity to be modulated.

[1032] The nucleic acid molecules of the invention can be inserted intovectors and used as gene therapy vectors. Gene therapy vectors can bedelivered to a subject by, for example, intravenous injection, localadministration (see U.S. Pat. No. 5,328,470) or by stereotacticinjection (see e.g., Chen et al. (1994) Proc. Natl. Acad. Sci. USA91:3054-3057). The pharmaceutical preparation of the gene therapy vectorcan include the gene therapy vector in an acceptable diluent, or cancomprise a slow release matrix in which the gene delivery vehicle isimbedded. Alternatively, where the complete gene delivery vector can beproduced intact from recombinant cells, e.g., retroviral vectors, thepharmaceutical preparation can include one or more cells which producethe gene delivery system.

[1033] The pharmaceutical compositions can be included in a container,pack, or dispenser together with instructions for administration.

[1034] Methods of Treatment:

[1035] The present invention provides for both prophylactic andtherapeutic methods of treating a subject at risk of (or susceptible to)a disorder or having a disorder associated with aberrant or unwanted21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593expression or activity. As used herein, the term “treatment” is definedas the application or administration of a therapeutic agent to apatient, or application or administration of a therapeutic agent to anisolated tissue or cell line from a patient, who has a disease, asymptom of disease or a predisposition toward a disease, with thepurpose to cure, heal, alleviate, relieve, alter, remedy, ameliorate,improve or affect the disease, the symptoms of disease or thepredisposition toward disease. A therapeutic agent includes, but is notlimited to, small molecules, peptides, antibodies, ribozymes andantisense oligonucleotides.

[1036] With regards to both prophylactic and therapeutic methods oftreatment, such treatments can be specifically tailored or modified,based on knowledge obtained from the field of pharmacogenomics.“Pharmacogenomics”, as used herein, refers to the application ofgenomics technologies such as gene sequencing, statistical genetics, andgene expression analysis to drugs in clinical development and on themarket. More specifically, the term refers the study of how a patient'sgenes determine his or her response to a drug (e.g., a patient's “drugresponse phenotype”, or “drug response genotype”.) Thus, another aspectof the invention provides methods for tailoring an individual'sprophylactic or therapeutic treatment with either the 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 molecules of thepresent invention or 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 modulators according to that individual's drugresponse genotype. Pharmacogenomics allows a clinician or physician totarget prophylactic or therapeutic treatments to patients who will mostbenefit from the treatment and to avoid treatment of patients who willexperience toxic drug-related side effects.

[1037] In one aspect, the invention provides a method for preventing ina subject, a disease or condition associated with an aberrant orunwanted 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 expression or activity, by administering to the subject a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 or anagent which modulates 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 expression or at least one 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 activity. Subjects atrisk for a disease which is caused or contributed to by aberrant orunwanted 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 expression or activity can be identified by, for example, any or acombination of diagnostic or prognostic assays as described herein.Administration of a prophylactic agent can occur prior to themanifestation of symptoms characteristic of the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 aberrance, such that adisease or disorder is prevented or, alternatively, delayed in itsprogression. Depending on the type of 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 aberrance, for example, a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593, 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 agonist or21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593antagonist agent can be used for treating the subject. The appropriateagent can be determined based on screening assays described herein.

[1038] It is possible that some 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 disorders can be caused, at least in part, byan abnormal level of gene product, or by the presence of a gene productexhibiting abnormal activity. As such, the reduction in the level and/oractivity of such gene products would bring about the amelioration ofdisorder symptoms.

[1039] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 molecules can act as novel diagnostic targets and therapeutic agentsfor controlling one or more of a cellular proliferative and/ordifferentiative disorder, brain disorder, platelet disorder, breastdisorder, colon disorder, kidney (renal) disorder, lung disorder,ovarian disorder, prostate disorder, cervical disorder, spleen disorder,thymus disorder, thyroid disorder, testes disorder, hematopoeiticdisorder, pancreatic disorder, skeletal muscle disorder, skin (dermal)disorder, disorder associated with bone metabolism, immune, e.g.,inflammatory, disorder, cardiovascular disorder, endothelial celldisorder, liver disorder, viral disease, pain disorder, metabolicdisorder, neurological or CNS disorder, erythroid disorder, blood vesseldisorder or angiogenic disorder, all of which are described above.

[1040] As discussed, successful treatment of 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 disorders can be brought aboutby techniques that serve to inhibit the expression or activity of targetgene products. For example, compounds, e.g., an agent identified usingan assays described above, that proves to exhibit negative modulatoryactivity, can be used in accordance with the invention to prevent and/orameliorate symptoms of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 disorders. Such molecules can include, but are notlimited to peptides, phosphopeptides, small organic or inorganicmolecules, or antibodies (including, for example, polyclonal,monoclonal, humanized, human, anti-idiotypic, chimeric or single chainantibodies, and Fab, F(ab′)₂ and Fab expression library fragments, scFVmolecules, and epitope-binding fragments thereof).

[1041] Further, antisense and ribozyme molecules that inhibit expressionof the target gene can also be used in accordance with the invention toreduce the level of target gene expression, thus effectively reducingthe level of target gene activity. Still further, triple helix moleculescan be utilized in reducing the level of target gene activity.Antisense, ribozyme and triple helix molecules are discussed above.

[1042] It is possible that the use of antisense, ribozyme, and/or triplehelix molecules to reduce or inhibit mutant gene expression can alsoreduce or inhibit the transcription (triple helix) and/or translation(antisense, ribozyme) of mRNA produced by normal target gene alleles,such that the concentration of normal target gene product present can belower than is necessary for a normal phenotype. In such cases, nucleicacid molecules that encode and express target gene polypeptidesexhibiting normal target gene activity can be introduced into cells viagene therapy method. Alternatively, in instances in that the target geneencodes an extracellular protein, it can be preferable to co-administernormal target gene protein into the cell or tissue in order to maintainthe requisite level of cellular or tissue target gene activity.

[1043] Another method by which nucleic acid molecules can be utilized intreating or preventing a disease characterized by 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 expression is throughthe use of aptamer molecules specific for 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 protein. Aptamers are nucleicacid molecules having a tertiary structure which permits them tospecifically or selectively bind to protein ligands (see, e.g., Osborneet al. (1997) Curr. Opin. Chem Biol. 1: 5-9; and Patel (1997) Curr OpinChem Biol 1:32-46). Since nucleic acid molecules can in many cases bemore conveniently introduced into target cells than therapeutic proteinmolecules can be, aptamers offer a method by which 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein activity can bespecifically decreased without the introduction of drugs or othermolecules which can have pluripotent effects.

[1044] Antibodies can be generated that are both specific for targetgene product and that reduce target gene product activity. Suchantibodies can, therefore, by administered in instances whereby negativemodulatory techniques are appropriate for the treatment of 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 disorders. For adescription of antibodies, see the Antibody section above.

[1045] In circumstances wherein injection of an animal or a humansubject with a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein or epitope for stimulating antibody production isharmful to the subject, it is possible to generate an immune responseagainst 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 through the use of anti-idiotypic antibodies (see, for example,Herlyn (1999) Ann Med 31:66-78; and Bhattacharya-Chatterjee and Foon(1998) Cancer Treat Res. 94:51-68). If an anti-idiotypic antibody isintroduced into a mammal or human subject, it should stimulate theproduction of anti-anti-idiotypic antibodies, which should be specificto the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein.

[1046] Vaccines directed to a disease characterized by 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 expression canalso be generated in this fashion.

[1047] In instances where the target antigen is intracellular and wholeantibodies are used, internalizing antibodies can be preferred.Lipofectin or liposomes can be used to deliver the antibody or afragment of the Fab region that binds to the target antigen into cells.Where fragments of the antibody are used, the smallest inhibitoryfragment that binds to the target antigen is preferred. For example,peptides having an amino acid sequence corresponding to the Fv region ofthe antibody can be used. Alternatively, single chain neutralizingantibodies that bind to intracellular target antigens can also beadministered. Such single chain antibodies can be administered, forexample, by expressing nucleotide sequences encoding single-chainantibodies within the target cell population (see e.g., Marasco et al.(1993) Proc. Natl. Acad. Sci. USA 90:7889-7893).

[1048] The identified compounds that inhibit target gene expression,synthesis and/or activity can be administered to a patient attherapeutically effective doses to prevent, treat or ameliorate 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 disorders.A therapeutically effective dose refers to that amount of the compoundsufficient to result in amelioration of symptoms of the disorders.Toxicity and therapeutic efficacy of such compounds can be determined bystandard pharmaceutical procedures as described above.

[1049] The data obtained from the cell culture assays and animal studiescan be used in formulating a range of dosage for use in humans. Thedosage of such compounds lies preferably within a range of circulatingconcentrations that include the ED₅₀ with little or no toxicity. Thedosage can vary within this range depending upon the dosage formemployed and the route of administration utilized. For any compound usedin the method of the invention, the therapeutically effective dose canbe estimated initially from cell culture assays. A dose can beformulated in animal models to achieve a circulating plasmaconcentration range that includes the IC₅₀ (i.e., the concentration ofthe test compound that achieves a half-maximal inhibition of symptoms)as determined in cell culture. Such information can be used to moreaccurately determine useful doses in humans. Levels in plasma can bemeasured, for example, by high performance liquid chromatography.

[1050] Another example of determination of effective dose for anindividual is the ability to directly assay levels of “free” and “bound”compound in the serum of the test subject. Such assays can utilizeantibody mimics and/or “biosensors” that have been created throughmolecular imprinting techniques. The compound which is able to modulate21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593activity is used as a template, or “imprinting molecule”, to spatiallyorganize polymerizable monomers prior to their polymerization withcatalytic reagents. The subsequent removal of the imprinted moleculeleaves a polymer matrix which contains a repeated “negative image” ofthe compound and is able to selectively rebind the molecule underbiological assay conditions. A detailed review of this technique can beseen in Ansell et al (1996) Current Opinion in Biotechnology 7:8994 andin Shea (1994) Trends in Polymer Science 2:166-173. Such “imprinted”affinity matrixes are amenable to ligand-binding assays, whereby theimmobilized monoclonal antibody component is replaced by anappropriately imprinted matrix. An example of the use of such matrixesin this way can be seen in Vlatakis et al (1993) Nature 361:645-647.Through the use of isotope-labeling, the “free” concentration ofcompound which modulates the expression or activity of 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 can be readilymonitored and used in calculations of IC₅₀.

[1051] Such “imprinted” affinity matrixes can also be designed toinclude fluorescent groups whose photon-emitting properties measurablychange upon local and selective binding of target compound. Thesechanges can be readily assayed in real time using appropriate fiberopticdevices, in turn allowing the dose in a test subject to be quicklyoptimized based on its individual IC₅₀. An rudimentary example of such a“biosensor” is discussed in Kriz et al (1995) Analytical Chemistry67:2142-2144.

[1052] Another aspect of the invention pertains to methods of modulating21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593expression or activity for therapeutic purposes. Accordingly, in anexemplary embodiment, the modulatory method of the invention involvescontacting a cell with a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 or agent that modulates one or more of theactivities of 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein activity associated with the cell. An agent thatmodulates 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein activity can be an agent as described herein, such as anucleic acid or a protein, a naturally-occurring target molecule of a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein (e.g., a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 substrate or receptor), a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 antibody, a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 agonist or antagonist, apeptidomimetic of a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 agonist or antagonist, or other small molecule.

[1053] In one embodiment, the agent stimulates one or 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 activities.Examples of such stimulatory agents include active 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein and a nucleicacid molecule encoding 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593. In another embodiment, the agent inhibits one ormore 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593activities. Examples of such inhibitory agents include antisense 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleicacid molecules, anti-21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 antibodies, and 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 inhibitors. These modulatory methodscan be performed in vitro (e.g., by culturing the cell with the agent)or, alternatively, in vivo (e.g., by administering the agent to asubject). As such, the present invention provides methods of treating anindividual afflicted with a disease or disorder characterized byaberrant or unwanted expression or activity of a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 protein or nucleic acidmolecule. In one embodiment, the method involves administering an agent(e.g., an agent identified by a screening assay described herein), orcombination of agents that modulates (e.g., up regulates or downregulates) 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 expression or activity. In another embodiment, the method involvesadministering a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 protein or nucleic acid molecule as therapy to compensatefor reduced, aberrant, or unwanted 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 expression or activity.

[1054] Stimulation of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 activity is desirable in situations in which 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 isabnormally downregulated and/or in which increased 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 activity is likely tohave a beneficial effect. For example, stimulation of 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 activity isdesirable in situations in which a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 is downregulated and/or in whichincreased 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 activity is likely to have a beneficial effect. Likewise, inhibitionof 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593activity is desirable in situations in which 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 is abnormally upregulatedand/or in which decreased 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 activity is likely to have a beneficialeffect.

[1055] Pharmacogenomics

[1056] The 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 molecules of the present invention, as well as agents, or modulatorswhich have a stimulatory or inhibitory effect on 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 activity (e.g., 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 geneexpression) as identified by a screening assay described herein can beadministered to individuals to treat (prophylactically ortherapeutically) 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593-associated disorders (e.g., aberrant or deficient guanylatekinase activity, phophatidylinositol 4-phosphate 5-kinase activity,kinase activity, transferase activity, aminopeptidase activity,adenylate cyclase activity, calpain protease activity, oxidoreductaseactivity, neprilysin protease activity, AMP binding enzyme activity orlysyl oxidase activity) associated with aberrant or unwanted 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 activity.

[1057] In conjunction with such treatment, pharmacogenomics (i.e., thestudy of the relationship between an individual's genotype and thatindividual's response to a foreign compound or drug) can be considered.Differences in metabolism of therapeutics can lead to severe toxicity ortherapeutic failure by altering the relation between dose and bloodconcentration of the pharmacologically active drug. Thus, a physician orclinician can consider applying knowledge obtained in relevantpharmacogenomics studies in determining whether to administer a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 moleculeor 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593modulator as well as tailoring the dosage and/or therapeutic regimen oftreatment with a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 molecule or 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 modulator.

[1058] Pharmacogenomics deals with clinically significant hereditaryvariations in the response to drugs due to altered drug disposition andabnormal action in affected persons. See, for example, Eichelbaum et al.(1996) Clin. Exp. Pharmacol. Physiol. 23:983-985 and Linder et al.(1997) Clin. Chem. 43:254-266. In general, two types of pharmacogeneticconditions can be differentiated. Genetic conditions transmitted as asingle factor altering the way drugs act on the body (altered drugaction) or genetic conditions transmitted as single factors altering theway the body acts on drugs (altered drug metabolism). Thesepharmacogenetic conditions can occur either as rare genetic defects oras naturally-occurring polymorphisms. For example, glucose-6-phosphatedehydrogenase deficiency (G6PD) is a common inherited enzymopathy inwhich the main clinical complication is haemolysis after ingestion ofoxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans)and consumption of fava beans.

[1059] One pharmacogenomics approach to identifying genes that predictdrug response, known as “a genome-wide association”, relies primarily ona high-resolution map of the human genome consisting of already knowngene-related markers (e.g., a “bi-allelic” gene marker map whichconsists of 60,000-100,000 polymorphic or variable sites on the humangenome, each of which has two variants.) Such a high-resolution geneticmap can be compared to a map of the genome of each of a statisticallysignificant number of patients taking part in a Phase II/III drug trialto identify markers associated with a particular observed drug responseor side effect. Alternatively, such a high resolution map can begenerated from a combination of some ten-million known single nucleotidepolymorphisms (SNPs) in the human genome. As used herein, a “SNP” is acommon alteration that occurs in a single nucleotide base in a stretchof DNA. For example, a SNP can occur once per every 1000 bases of DNA. ASNP can be involved in a disease process, however, the vast majority cannot be disease-associated. Given a genetic map based on the occurrenceof such SNPs, individuals can be grouped into genetic categoriesdepending on a particular pattern of SNPs in their individual genome. Insuch a manner, treatment regimens can be tailored to groups ofgenetically similar individuals, taking into account traits that can becommon among such genetically similar individuals.

[1060] Alternatively, a method termed the “candidate gene approach”, canbe utilized to identify genes that predict drug response. According tothis method, if a gene that encodes a drug's target is known (e.g., a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593protein of the present invention), all common variants of that gene canbe fairly easily identified in the population and it can be determinedif having one version of the gene versus another is associated with aparticular drug response.

[1061] Alternatively, a method termed the “gene expression profiling”,can be utilized to identify genes that predict drug response. Forexample, the gene expression of an animal dosed with a drug (e.g., a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593molecule or 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 modulator of the present invention) can give an indication whethergene pathways related to toxicity have been turned on.

[1062] Information generated from more than one of the abovepharmacogenomics approaches can be used to determine appropriate dosageand treatment regimens for prophylactic or therapeutic treatment of anindividual. This knowledge, when applied to dosing or drug selection,can avoid adverse reactions or therapeutic failure and thus enhancetherapeutic or prophylactic efficiency when treating a subject with a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593molecule or 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 modulator, such as a modulator identified by one of the exemplaryscreening assays described herein.

[1063] The present invention further provides methods for identifyingnew agents, or combinations, that are based on identifying agents thatmodulate the activity of one or more of the gene products encoded by oneor more of the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 genes of the present invention, wherein these products canbe associated with resistance of the cells to a therapeutic agent.Specifically, the activity of the proteins encoded by the 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 genes of thepresent invention can be used as a basis for identifying agents forovercoming agent resistance. By blocking the activity of one or more ofthe resistance proteins, target cells, e.g., human cells, will becomesensitive to treatment with an agent to which the unmodified targetcells were resistant.

[1064] Monitoring the influence of agents (e.g., drugs) on theexpression or activity of a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 protein can be applied in clinical trials.For example, the effectiveness of an agent determined by a screeningassay as described herein to increase 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 gene expression, protein levels, orupregulate 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 activity, can be monitored in clinical trials of subjects exhibitingdecreased 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 gene expression, protein levels, or downregulated 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 activity.Alternatively, the effectiveness of an agent determined by a screeningassay to decrease 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 gene expression, protein levels, or downregulate 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 activity,can be monitored in clinical trials of subjects exhibiting increased21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593gene expression, protein levels, or upregulated 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 activity. In suchclinical trials, the expression or activity of a 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 gene, and preferably,other genes that have been implicated in, for example, a proteinkinase-associated or another 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593-associated disorder can be used as a “readout” or markers of the phenotype of a particular cell.

[1065] Other Embodiments

[1066] In another aspect, the invention features a method of analyzing aplurality of capture probes. The method is useful, e.g., to analyze geneexpression. The method includes: providing a two dimensional arrayhaving a plurality of addresses, each address of the plurality beingpositionally distinguishable from each other address of the plurality,and each address of the plurality having a unique capture probe, e.g., anucleic acid or peptide sequence, wherein the capture probes are from acell or subject which expresses 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 or from a cell or subject in which a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 mediatedresponse has been elicited; contacting the array with a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleic acid(preferably purified), a 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 polypeptide (preferably purified), or an anti-21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 antibody,and thereby evaluating the plurality of capture probes. Binding, e.g.,in the case of a nucleic acid, hybridization with a capture probe at anaddress of the plurality, is detected, e.g., by a signal generated froma label attached to the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 nucleic acid, polypeptide, or antibody.

[1067] The capture probes can be a set of nucleic acids from a selectedsample, e.g., a sample of nucleic acids derived from a control ornon-stimulated tissue or cell.

[1068] The method can include contacting the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 nucleic acid, polypeptide, orantibody with a first array having a plurality of capture probes and asecond array having a different plurality of capture probes. The resultsof each hybridization can be compared, e.g., to analyze differences inexpression between a first and second sample. The first plurality ofcapture probes can be from a control sample, e.g., a wild type, normal,or non-diseased, non-stimulated, sample, e.g., a biological fluid,tissue, or cell sample. The second plurality of capture probes can befrom an experimental sample, e.g., a mutant type, at risk, disease-stateor disorder-state, or stimulated, sample, e.g., a biological fluid,tissue, or cell sample.

[1069] The plurality of capture probes can be a plurality of nucleicacid probes each of which specifically hybridizes, with an allele of21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593.Such methods can be used to diagnose a subject, e.g., to evaluate riskfor a disease or disorder, to evaluate suitability of a selectedtreatment for a subject, to evaluate whether a subject has a disease ordisorder.

[1070] The method can be used to detect SNPs, as described above.

[1071] In another aspect, the invention features, a method of analyzing21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593,e.g., analyzing structure, function, or relatedness to other nucleicacid or amino acid sequences. The method includes: providing a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 nucleicacid or amino acid sequence; comparing the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 sequence with one or morepreferably a plurality of sequences from a collection of sequences,e.g., a nucleic acid or protein sequence database; to thereby analyze21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593.

[1072] The method can include evaluating the sequence identity between a21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593sequence and a database sequence. The method can be performed byaccessing the database at a second site, e.g., over the internet.Preferred databases include GenBank™ and SwissProt.

[1073] In another aspect, the invention features, a set ofoligonucleotides, useful, e.g., for identifying SNP's, or identifyingspecific alleles of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593. The set includes a plurality of oligonucleotides,each of which has a different nucleotide at an interrogation position,e.g., an SNP or the site of a mutation. In a preferred embodiment, theoligonucleotides of the plurality identical in sequence with one another(except for differences in length). The oligonucleotides can be providedwith differential labels, such that an oligonucleotide which hybridizesto one allele provides a signal that is distinguishable from anoligonucleotide which hybridizes to a second allele.

[1074] The sequences of 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 molecules are provided in a variety of mediums tofacilitate use thereof. A sequence can be provided as a manufacture,other than an isolated nucleic acid or amino acid molecule, whichcontains a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 molecule. Such a manufacture can provide a nucleotide or amino acidsequence, e.g., an open reading frame, in a form which allowsexamination of the manufacture using means not directly applicable toexamining the nucleotide or amino acid sequences, or a subset thereof,as they exist in nature or in purified form.

[1075] A 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 nucleotide or amino acid sequence can be recorded on computerreadable media. As used herein, “computer readable media” refers to anymedium that can be read and accessed directly by a computer. Such mediainclude, but are not limited to: magnetic storage media, such as floppydiscs, hard disc storage medium, and magnetic tape; optical storagemedia such as compact disc and CD-ROM; electrical storage media such asRAM, ROM, EPROM, EEPROM, and the like; and general hard disks andhybrids of these categories such as magnetic/optical storage media. Themedium is adapted or configured for having thereon 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 sequence information ofthe present invention.

[1076] As used herein, the term “electronic apparatus” is intended toinclude any suitable computing or processing apparatus of other deviceconfigured or adapted for storing data or information. Examples ofelectronic apparatus suitable for use with the present invention includestand-alone computing apparatus; networks, including a local areanetwork (LAN), a wide area network (WAN) Internet, Intranet, andExtranet; electronic appliances such as personal digital assistants(PDAs), cellular phones, pagers, and the like; and local and distributedprocessing systems.

[1077] As used herein, “recorded” refers to a process for storing orencoding information on the electronic apparatus readable medium. Thoseskilled in the art can readily adopt any of the presently known methodsfor recording information on known media to generate manufacturescomprising the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 sequence information.

[1078] A variety of data storage structures are available to a skilledartisan for creating a computer readable medium having recorded thereona 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593nucleotide or amino acid sequence of the present invention. The choiceof the data storage structure will generally be based on the meanschosen to access the stored information. In addition, a variety of dataprocessor programs and formats can be used to store the nucleotidesequence information of the present invention on computer readablemedium. The sequence information can be represented in a word processingtext file, formatted in commercially-available software such asWordPerfect and Microsoft Word, or represented in the form of an ASCIIfile, stored in a database application, such as DB2, Sybase, Oracle, orthe like. The skilled artisan can readily adapt any number of dataprocessor structuring formats (e.g., text file or database) in order toobtain computer readable medium having recorded thereon the nucleotidesequence information of the present invention.

[1079] By providing the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 nucleotide or amino acid sequences of the inventionin computer readable form, the skilled artisan can routinely access thesequence information for a variety of purposes. For example, one skilledin the art can use the nucleotide or amino acid sequences of theinvention in computer readable form to compare a target sequence ortarget structural motif with the sequence information stored within thedata storage means. A search is used to identify fragments or regions ofthe sequences of the invention which match a particular target sequenceor target motif.

[1080] The present invention therefore provides a medium for holdinginstructions for performing a method for determining whether a subjecthas a guanylate kinase, phophatidylinositol 4-phosphate 5-kinase,kinase, transferase, aminopeptidase, adenylate cyclase, calpainprotease, oxidoreductase, neprilysin protease, AMP binding enzyme orlysyl oxidase-associated or another 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593-associated disease or disorder or apre-disposition to a guanylate kinase, phophatidylinositol 4-phosphate5-kinase, kinase, transferase, aminopeptidase, adenylate cyclase,calpain protease, oxidoreductase, neprilysin protease, AMP bindingenzyme or lysyl oxidase-associated or another 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593-associated disease or disorder,wherein the method comprises the steps of determining 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 sequenceinformation associated with the subject and based on the 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 sequenceinformation, determining whether the subject has a guanylate kinase,phophatidylinositol 4-phosphate 5-kinase, kinase, transferase,aminopeptidase, adenylate cyclase, calpain protease, oxidoreductase,neprilysin protease, AMP binding enzyme or lysyl oxidase-associated oranother 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593-associated disease or disorder and/or recommending a particulartreatment for the disease, disorder, or pre-disease condition.

[1081] The present invention further provides in an electronic systemand/or in a network, a method for determining whether a subject has aguanylate kinase, phophatidylinositol 4-phosphate 5-kinase, kinase,transferase, aminopeptidase, adenylate cyclase, calpain protease,oxidoreductase, neprilysin protease, AMP binding enzyme or lysyloxidase-associated or another 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593-associated disease or disorder or apre-disposition to a disease associated with 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593, wherein the method comprisesthe steps of determining 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 sequence information associated with the subject,and based on the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 sequence information, determining whether the subject has aguanylate kinase, phophatidylinositol 4-phosphate 5-kinase, kinase,transferase, aminopeptidase, adenylate cyclase, calpain protease,oxidoreductase, neprilysin protease, AMP binding enzyme or lysyloxidase-associated or another 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593-associated disease or disorder or apre-disposition to a guanylate kinase, phophatidylinositol 4-phosphate5-kinase, kinase, transferase, aminopeptidase, adenylate cyclase,calpain protease, oxidoreductase, neprilysin protease, AMP bindingenzyme or lysyl oxidase-associated or another 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593-associated disease or disorder,and/or recommending a particular treatment for the disease, disorder, orpre-disease condition. The method may further comprise the step ofreceiving phenotypic information associated with the subject and/oracquiring from a network phenotypic information associated with thesubject.

[1082] The present invention also provides in a network, a method fordetermining whether a subject has a guanylate kinase,phophatidylinositol 4-phosphate 5-kinase, kinase, transferase,aminopeptidase, adenylate cyclase, calpain protease, oxidoreductase,neprilysin protease, AMP binding enzyme or lysyl oxidase-associated oranother 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593-associated disease or disorder or a pre-disposition to a guanylatekinase, phophatidylinositol 4-phosphate 5-kinase, kinase, transferase,aminopeptidase, adenylate cyclase, calpain protease, oxidoreductase,neprilysin protease, AMP binding enzyme or lysyl oxidase-associated oranother 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593-associated disease or disorder, said method comprising the steps ofreceiving 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 sequence information from the subject and/or information relatedthereto, receiving phenotypic information associated with the subject,acquiring information from the network corresponding to 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 and/orcorresponding to a guanylate kinase, phophatidylinositol 4-phosphate5-kinase, kinase, transferase, aminopeptidase, adenylate cyclase,calpain protease, oxidoreductase, neprilysin protease, AMP bindingenzyme or lysyl oxidase-associated or another 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593-associated disease or disorder,and based on one or more of the phenotypic information, the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593information (e.g., sequence information and/or information relatedthereto), and the acquired information, determining whether the subjecthas a guanylate kinase, phophatidylinositol 4-phosphate 5-kinase,kinase, transferase, aminopeptidase, adenylate cyclase, calpainprotease, oxidoreductase, neprilysin protease, AMP binding enzyme orlysyl oxidase-associated or another 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593-associated disease or disorder or apre-disposition to a guanylate kinase, phophatidylinositol 4-phosphate5-kinase, kinase, transferase, aminopeptidase, adenylate cyclase,calpain protease, oxidoreductase, neprilysin protease, AMP bindingenzyme or lysyl oxidase-associated or another 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593-associated disease or disorder.The method may further comprise the step of recommending a particulartreatment for the disease, disorder, or pre-disease condition.

[1083] The present invention also provides a business method fordetermining whether a subject has a guanylate kinase,phophatidylinositol 4-phosphate 5-kinase, kinase, transferase,aminopeptidase, adenylate cyclase, calpain protease, oxidoreductase,neprilysin protease, AMP binding enzyme or lysyl oxidase-associated oranother 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593-associated disease or disorder or a pre-disposition to a guanylatekinase, phophatidylinositol 4-phosphate 5-kinase, kinase, transferase,aminopeptidase, adenylate cyclase, calpain protease, oxidoreductase,neprilysin protease, AMP binding enzyme or lysyl oxidase-associated oranother 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593-associated disease or disorder, said method comprising the steps ofreceiving information related to 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 (e.g., sequence information and/orinformation related thereto), receiving phenotypic informationassociated with the subject, acquiring information from the networkrelated to 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 and/or related to a guanylate kinase, phophatidylinositol4-phosphate 5-kinase, kinase, transferase, aminopeptidase, adenylatecyclase, calpain protease, oxidoreductase, neprilysin protease, AMPbinding enzyme or lysyl oxidase-associated or another 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593-associateddisease or disorder, and based on one or more of the phenotypicinformation, the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 information, and the acquired information, determiningwhether the subject has a guanylate kinase, phophatidylinositol4-phosphate 5-kinase, kinase, transferase, aminopeptidase, adenylatecyclase, calpain protease, oxidoreductase, neprilysin protease, AMPbinding enzyme or lysyl oxidase-associated or another 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593-associateddisease or disorder or a pre-disposition to a guanylate kinase,phophatidylinositol 4-phosphate 5-kinase, kinase, transferase,aminopeptidase, adenylate cyclase, calpain protease, oxidoreductase,neprilysin protease, AMP binding enzyme or lysyl oxidase-associated oranother 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593-associated disease or disorder. The method may further comprise thestep of recommending a particular treatment for the disease, disorder,or pre-disease condition.

[1084] The invention also includes an array comprising a 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 sequence of thepresent invention. The array can be used to assay expression of one ormore genes in the array. In one embodiment, the array can be used toassay gene expression in a tissue to ascertain tissue specificity ofgenes in the array. In this manner, up to about 7600 genes can besimultaneously assayed for expression, one of which can be 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593. This allows aprofile to be developed showing a battery of genes specificallyexpressed in one or more tissues.

[1085] In addition to such qualitative information, the invention allowsthe quantitation of gene expression. Thus, not only tissue specificity,but also the level of expression of a battery of genes in the tissue ifascertainable. Thus, genes can be grouped on the basis of their tissueexpression per se and level of expression in that tissue. This isuseful, for example, in ascertaining the relationship of gene expressionin that tissue. Thus, one tissue can be perturbed and the effect on geneexpression in a second tissue can be determined. In this context, theeffect of one cell type on another cell type in response to a biologicalstimulus can be determined. In this context, the effect of one cell typeon another cell type in response to a biological stimulus can bedetermined. Such a determination is useful, for example, to know theeffect of cell-cell interaction at the level of gene expression. If anagent is administered therapeutically to treat one cell type but has anundesirable effect on another cell type, the invention provides an assayto determine the molecular basis of the undesirable effect and thusprovides the opportunity to co-administer a counteracting agent orotherwise treat the undesired effect. Similarly, even within a singlecell type, undesirable biological effects can be determined at themolecular level. Thus, the effects of an agent on expression of otherthan the target gene can be ascertained and counteracted.

[1086] In another embodiment, the array can be used to monitor the timecourse of expression of one or more genes in the array. This can occurin various biological contexts, as disclosed herein, for exampledevelopment of a guanylate kinase, phophatidylinositol 4-phosphate5-kinase, kinase, transferase, aminopeptidase, adenylate cyclase,calpain protease, oxidoreductase, neprilysin protease, AMP bindingenzyme or lysyl oxidase-associated or another 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593-associated disease or disorder,progression of a guanylate kinase, phophatidylinositol 4-phosphate5-kinase, kinase, transferase, aminopeptidase, adenylate cyclase,calpain protease, oxidoreductase, neprilysin protease, AMP bindingenzyme or lysyl oxidase-associated or another 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593-associated disease or disorder,and processes, such a cellular transformation associated with theguanylate kinase, phophatidylinositol 4-phosphate 5-kinase, kinase,transferase, aminopeptidase, adenylate cyclase, calpain protease,oxidoreductase, neprilysin protease, AMP binding enzyme or lysyloxidase-associated or another 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593-associated disease or disorder.

[1087] The array is also useful for ascertaining the effect of theexpression of a gene on the expression of other genes in the same cellor in different cells (e.g., acertaining the effect of 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 expression on theexpression of other genes). This provides, for example, for a selectionof alternate molecular targets for therapeutic intervention if theultimate or downstream target cannot be regulated.

[1088] The array is also useful for ascertaining differential expressionpatterns of one or more genes in normal and abnormal cells. Thisprovides a battery of genes (e.g., including 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593) that could serve as amolecular target for diagnosis or therapeutic intervention.

[1089] As used herein, a “target sequence” can be any DNA or amino acidsequence of six or more nucleotides or two or more amino acids. Askilled artisan can readily recognize that the longer a target sequenceis, the less likely a target sequence will be present as a randomoccurrence in the database. Typical sequence lengths of a targetsequence are from about 10 to 100 amino acids or from about 30 to 300nucleotide residues. However, it is well recognized that commerciallyimportant fragments, such as sequence fragments involved in geneexpression and protein processing, may be of shorter length.

[1090] Computer software is publicly available which allows a skilledartisan to access sequence information provided in a computer readablemedium for analysis and comparison to other sequences. A variety ofknown algorithms are disclosed publicly and a variety of commerciallyavailable software for conducting search means are and can be used inthe computer-based systems of the present invention. Examples of suchsoftware include, but are not limited to, MacPattern (EMBL), BLASTN andBLASTX (NCBI).

[1091] Thus, the invention features a method of making a computerreadable record of a sequence of a 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 sequence which includes recording thesequence on a computer readable matrix. In a preferred embodiment therecord includes one or more of the following: identification of an ORF;identification of a domain, region, or site; identification of the startof transcription; identification of the transcription terminator; thefull length amino acid sequence of the protein, or a mature formthereof; the 5′ end of the translated region.

[1092] In another aspect, the invention features a method of analyzing asequence. The method includes: providing a 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 sequence, or record, incomputer readable form; comparing a second sequence to the 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 sequence; therebyanalyzing a sequence. Comparison can include comparing to sequences forsequence identity or determining if one sequence is included within theother, e.g., determining if the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 sequence includes a sequence being compared.In a preferred embodiment the 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 or second sequence is stored on a firstcomputer, e.g., at a first site and the comparison is performed, read,or recorded on a second computer, e.g., at a second site. E.g., the21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 orsecond sequence can be stored in a public or proprietary database in onecomputer, and the results of the comparison performed, read, or recordedon a second computer. In a preferred embodiment the record includes oneor more of the following: identification of an ORF; identification of adomain, region, or site; identification of the start of transcription;identification of the transcription terminator; the full length aminoacid sequence of the protein, or a mature form thereof; the 5′ end ofthe translated region.

EXEMPLIFICATION Example 1 Tissue Distribution of 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 mRNA

[1093] Northern blot hybridizations with various RNA samples can beperformed under standard conditions and washed under stringentconditions, i.e., 0.2×SSC at 65° C. A DNA probe corresponding to all ora portion of the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 cDNA (SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26,31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68,71, 73, 88, 90, 104, 106, 107, 109, 111 or 113) or 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 cDNA can be used. TheDNA was radioactively labeled with ³²P-dCTP using the Prime-It Kit(Stratagene, La Jolla, Calif.) according to the instructions of thesupplier. Filters containing mRNA from mouse hematopoietic and endocrinetissues, and cancer cell lines (Clontech, Palo Alto, Calif.) can beprobed in ExpressHyb hybridization solution (Clontech) and washed athigh stringency according to manufacturer's recommendations.

Example 2 Recombinant Expression of 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 in Bacterial Cells

[1094] In this example, 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 is expressed as a recombinantglutathione-S-transferase (GST) fusion polypeptide in E. coli and thefusion polypeptide is isolated and characterized. Specifically, 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 is fusedto GST and this fusion polypeptide is expressed in E. coli, e.g., strainPEB199. Expression of the GST-21910, -56634, -55053, -2504, -15977,-14760, -25501, -17903, -3700, -21529, -26176, -26343, -56638, -18610,-33217, -21967, -h1983, -m1983, -38555 or -593 fusion protein in PEB199is induced with IPTG. The recombinant fusion polypeptide is purifiedfrom crude bacterial lysates of the induced PEB199 strain by affinitychromatography on glutathione beads. Using polyacrylamide gelelectrophoretic analysis of the polypeptide purified from the bacteriallysates, the molecular weight of the resultant fusion polypeptide isdetermined.

Example 3 Expression of Recombinant 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 Protein in COS Cells

[1095] To express the 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 gene in COS cells, the pcDNA/Amp vector byInvitrogen Corporation (San Diego, Calif.) is used. This vector containsan SV40 origin of replication, an ampicillin resistance gene, an E. colireplication origin, a CMV promoter followed by a polylinker region, andan SV40 intron and polyadenylation site. A DNA fragment encoding theentire 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 protein and an HA tag (Wilson et al. (1984) Cell 37:767) or a FLAGtag fused in-frame to its 3′ end of the fragment is cloned into thepolylinker region of the vector, thereby placing the expression of therecombinant protein under the control of the CMV promoter.

[1096] To construct the plasmid, the 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 DNA sequence is amplified by PCR usingtwo primers. The 5′ primer contains the restriction site of interestfollowed by approximately twenty nucleotides of the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 coding sequence startingfrom the initiation codon; the 3′ end sequence contains complementarysequences to the other restriction site of interest, a translation stopcodon, the HA tag or FLAG tag and the last 20 nucleotides of the 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 codingsequence. The PCR amplified fragment and the pcDNA/Amp vector aredigested with the appropriate} restriction enzymes and the vector isdephosphorylated using the CIAP enzyme (New England Biolabs, Beverly,Mass.). Preferably the two restriction sites chosen are different sothat the 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 gene is inserted in the correct orientation. The ligation mixture istransformed into E. coli cells (strains HB101, DH5α, SURE, availablefrom Stratagene Cloning Systems, La Jolla, Calif., can be used), thetransformed culture is plated on ampicillin media plates, and resistantcolonies are selected. Plasmid DNA is isolated from transformants andexamined by restriction analysis for the presence of the correctfragment.

[1097] COS cells are subsequently transfected with the 21910-, 56634-,55053-, 2504-, 15977-, 14760-, 25501-, 17903-, 3700-, 21529-, 26176-,26343-, 56638-, 18610-, 33217-, 21967-, h1983-, m1983-, 38555- or593-pcDNA/Amp plasmid DNA using the calcium phosphate or calciumchloride co-precipitation methods, DEAE-dextran-mediated transfection,lipofection, or electroporation. Other suitable methods for transfectinghost cells can be found in Sambrook, J., Fritsh, E. F., and Maniatis, T.Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring HarborLaboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor,N.Y., 1989. The expression of the 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 polypeptide is detected byradiolabelling (³⁵S-methionine or ³⁵S-cysteine available from NEN,Boston, Mass., can be used) and immunoprecipitation (Harlow, E. andLane, D. Antibodies: A Laboratory Manual, Cold Spring Harbor LaboratoryPress, Cold Spring Harbor, N.Y., 1988) using an HA specific monoclonalantibody. Briefly, the cells are labeled for 8 hours with ³⁵S-methionine(or ³⁵S-cysteine). The culture media are then collected and the cellsare lysed using detergents (RIPA-buffer, 150 mM NaCl, 1% NP-40, 0.1%SDS, 0.5% DOC, 50 mM Tris, pH 7.5). Both the cell lysate and the culturemedia are precipitated with an HA specific monoclonal antibody.Precipitated polypeptides are then analyzed by SDS-PAGE.

[1098] Alternatively, DNA containing the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 coding sequence is cloneddirectly into the polylinker of the pcDNA/Amp vector using theappropriate restriction sites. The resulting plasmid is transfected intoCOS cells in the manner described above, and the expression of the21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593polypeptide is detected by radiolabelling and immunoprecipitation usinga 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593specific monoclonal antibody.

Exmaple 4 TaqMan Analysis of 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593

[1099] Human 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 expression was measured by TaqMan® quantitative PCR (PerkinElmer Applied Biosystems) in cDNA prepared from a variety of normal anddiseased (e.g., cancerous) human tissues or cell lines.

[1100] Probes were designed by PrimerExpress software (PE Biosystems)based on the sequence of the human 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 gene. Each human 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 gene probe was labeledusing FAM (6-carboxyfluorescein), and the P2-microglobulin referenceprobe was labeled with a different fluorescent dye, VIC. Thedifferential labeling of the target gene and internal reference genethus enabled measurement in same well. Forward and reverse primers andthe probes for both β2-microglobulin and target gene were added to theTaqMan® Universal PCR Master Mix (PE Applied Biosystems). Although thefinal concentration of primer and probe could vary, each was internallyconsistent within a given experiment. A typical experiment contained 200nM of forward and reverse primers plus 100 nM probe for β-2microglobulin and 600 nM forward and reverse primers plus 200 nM probefor the target gene. TaqMan matrix experiments were carried out on anABI PRISM 7700 Sequence Detection System (PE Applied Biosystems). Thethermal cycler conditions were as follows: hold for 2 min at 50° C. and10 min at 95° C., followed by two-step PCR for 40 cycles of 95° C. for15 sec followed by 60° C. for 1 min.

[1101] The following method was used to quantitatively calculate human21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593gene expression in the various tissues relative to β-2 microglobulinexpression in the same tissue. The threshold cycle (Ct) value is definedas the cycle at which a statistically significant increase influorescence is detected. A lower Ct value is indicative of a highermRNA concentration. The Ct value of the human 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 gene is normalized bysubtracting the Ct value of the β-2 microglobulin gene to obtain a_(Δ)Ct value using the following formula: _(ΔCt=Ct)_(sample)−Ct_(β-2 microglobulin). Expression is then calibrated againsta cDNA sample showing a comparatively low level of expression of thehuman 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 gene. The _(Δ)Ct value for the calibrator sample is then subtractedfrom _(Δ)Ct for each tissue sample according to the following formula:_(ΔΔ)Ct=_(Δ)Ct−_(sample)−_(Δ)Ct−_(calibrator). Relative expression isthen calculated using the arithmetic formula given by 2^(−ΔΔCt).

Example 5 In Situ Hybridization of 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593

[1102] The following describes the tissue distribution of 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 mRNA, as may bedetermined by in situ hybridization analysis using oligonucleotideprobes based on the human G2RF sequence.

[1103] For in situ analysis, various tissues, e.g. tissues obtained frombrain, are first frozen on dry ice. Ten-micrometer-thick sections of thetissues are postfixed with 4% formaldehyde in DEPC treated 1×phosphate-buffered saline at room temperature for 10 minutes beforebeing rinsed twice in DEPC 1× phosphate-buffered saline and once in 0.1M triethanolamine-HCl (pH 8.0). Following incubation in 0.25% aceticanhydride-0.1 M triethanolamine-HCl for 10 minutes, sections are rinsedin DEPC 2×SSC (1×SSC is 0.15 μM NaCl plus 0.015M sodium citrate). Tissueis then dehydrated through a series of ethanol washes, incubated in 100%chloroform for 5 minutes, and then rinsed in 100% ethanol for 1 minuteand 95% ethanol for 1 minute and allowed to air dry.

[1104] Hybridizations are performed with ³⁵S-radiolabeled (5×10 cpm/ml)cRNA probes. Probes are incubated in the presence of a solutioncontaining 600 mM NaCl, 10 mM Tris (pH 7.5), 1 mM EDTA, 0.01% shearedsalmon sperm DNA, 0.01% yeast tRNA, 0.05% yeast total RNA type X1, 1×Denhardt's solution, 50% formamide, 10% dextran sulfate, 100 mMdithiothreitol, 0.1% sodium dodecyl sulfate (SDS), and 0.1% sodiumthiosulfate for 18 hours at 55° C.

[1105] After hybridization, slides are washed with 2×SSC. Sections arethen sequentially incubated at 37° C. in TNE (a solution containing 10mM Tris-HCl (pH 7.6), 500 mM NaCl, and 1 mM EDTA), for 10 minutes, inTNE with 110 μg of RNase A per ml for 30 minutes, and finally in TNE for10 minutes. Slides are then rinsed with 2×SSC at room temperature,washed with 2×SSC at 50° C. for 1 hour, washed with 0.2×SSC at 55° C.for 1 hour, and 0.2×SSC at 60° C. for 1 hour. Sections are thendehydrated rapidly through serial ethanol-0.3 M sodium acetateconcentrations before being air dried and exposed to Kodak Biomax MRscientific imaging film for 24 hours and subsequently dipped in NB-2photoemulsion and exposed at 4° C. for 7 days before being developed andcounter stained.

[1106] The contents of all references, patents and published patentapplications cited throughout this application are incorporated hereinby reference.

[1107] Equivalents

[1108] Those skilled in the art will recognize, or be able to ascertainusing no more than routine experimentation, many equivalents to thespecific embodiments of the invention described herein.

0 SEQUENCE LISTING The patent application contains a lengthy “SequenceListing” section. A copy of the “Sequence Listing” is available inelectronic form from the USPTO web site(http://seqdata.uspto.gov/sequence.html?DocID=20040058355). Anelectronic copy of the “Sequence Listing” will also be available fromthe USPTO upon request and payment of the fee set forth in 37 CFR1.19(b)(3).

What is claimed is:
 1. An isolated 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 nucleic acid molecule selected fromthe group consisting of: a) a nucleic acid molecule comprising anucleotide sequence which is at least 60% identical to the nucleotidesequence of SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31,33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71,73, 88, 90, 104, 106, 107, 109, 111 or 113, or the nucleotide sequenceof the DNA insert of the plasmid deposited with ATCC Accession Number______; b) a nucleic acid molecule comprising a fragment of at least 15nucleotides of the nucleotide sequence of SEQ ID NO:1, 3, 5, 7, 10, 12,18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56,57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113,or the nucleotide sequence of the DNA insert of the plasmid depositedwith ATCC Accession Number ______; c) a nucleic acid molecule whichencodes a polypeptide comprising the amino acid sequence of SEQ ID NO:2,6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108or 112, or the amino acid sequence encoded by the cDNA insert of theplasmid deposited with the ATCC Accession Number ______; d) a nucleicacid molecule which encodes a fragment of a polypeptide comprising theamino acid sequence of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47,50, 55, 58, 64, 67, 72, 89, 105, 108 or 112, or the amino acid sequenceencoded by the cDNA insert of the plasmid deposited with the ATCCAccession Number ______, wherein the fragment comprises at least 15contiguous amino acids of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44,47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112, or the amino acidsequence encoded by the cDNA insert of the plasmid deposited with theATCC Accession Number ______; e) a nucleic acid molecule which encodes anaturally occurring allelic variant of a polypeptide comprising theamino acid sequence of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47,50, 55, 58, 64, 67, 72, 89, 105, 108 or 112 or the amino acid sequenceencoded by the cDNA insert of the plasmid deposited with the ATCCAccession Number ______, wherein the nucleic acid molecule hybridizes toa nucleic acid molecule comprising SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20,21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59,63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113, or acomplement thereof, under stringent conditions; f) a nucleic acidmolecule comprising the nucleotide sequence of SEQ ID NO:1, 3, 5, 7, 10,12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54,56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or113, or the nucleotide sequence of the DNA insert of the plasmiddeposited with ATCC Accession Number ______; and g) a nucleic acidmolecule which encodes a polypeptide comprising the amino acid sequenceof SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67,72, 89, 105, 108 or 112, or the amino acid sequence encoded by the cDNAinsert of the plasmid deposited with the ATCC Accession Number ______.2. The isolated nucleic acid molecule of claim 1, which is thenucleotide sequence SEQ ID NO:1, 5, 10, 18, 21, 24, 31, 39, 43, 46, 49,54, 57, 63, 66, 71, 88, 104, 107 or
 111. 3. A host cell which containsthe nucleic acid molecule of claim
 1. 4. An isolated 21910, 56634,55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343,56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptideselected from the group consisting of: a) a polypeptide which is encodedby a nucleic acid molecule comprising a nucleotide sequence which is atleast 60% identical to a nucleic acid comprising the nucleotide sequenceof SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41,43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90,104, 106, 107, 109, 111 or 113, or the nucleotide sequence of the DNAinsert of the plasmid deposited with ATCC Accession Number ______, or acomplement thereof; b) a naturally occurring allelic variant of apolypeptide comprising the amino acid sequence of SEQ ID NO:2, 6, 11,19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112,or the amino acid sequence encoded by the cDNA insert of the plasmiddeposited with the ATCC Accession Number ______, wherein the polypeptideis encoded by a nucleic acid molecule which hybridizes to a nucleic acidmolecule comprising SEQ ID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24,26, 31, 33, 39, 41, 43, 45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66,68, 71, 73, 88, 90, 104, 106, 107, 109, 111 or 113, or a complementthereof under stringent conditions; c) a fragment of a polypeptidecomprising the amino acid sequence of SEQ ID NO:2, 6, 11, 19, 22, 25,32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112, or theamino acid sequence encoded by the cDNA insert of the plasmid depositedwith the ATCC Accession Number ______, wherein the fragment comprises atleast 15 contiguous amino acids of SEQ ID NO:2, 6, 11, 19, 22, 25, 32,40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112; and d) theamino acid sequence of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47,50, 55, 58, 64, 67, 72, 89, 105, 108 or
 112. 5. An antibody whichselectively binds to a polypeptide of claim
 4. 6. The polypeptide ofclaim 4, further comprising heterologous amino acid sequences.
 7. Amethod for producing a polypeptide selected from the group consistingof: a) a polypeptide comprising the amino acid sequence of SEQ ID NO:2,6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108or 112, or the amino acid sequence encoded by the cDNA insert of theplasmid deposited with the ATCC Accession Number ______; b) apolypeptide comprising a fragment of the amino acid sequence of SEQ IDNO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89,105, 108 or 112, or the amino acid sequence encoded by the cDNA insertof the plasmid deposited with the ATCC Accession Number ______, whereinthe fragment comprises at least 15 contiguous amino acids of SEQ IDNO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89,105, 108 or 112, or the amino acid sequence encoded by the cDNA insertof the plasmid deposited with the ATCC Accession Number ______; c) anaturally occurring allelic variant of a polypeptide comprising theamino acid sequence of SEQ ID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47,50, 55, 58, 64, 67, 72, 89, 105, 108 or 112, or the amino acid sequenceencoded by the cDNA insert of the plasmid deposited with the ATCCAccession Number ______, wherein the polypeptide is encoded by a nucleicacid molecule which hybridizes to a nucleic acid molecule comprising SEQID NO:1, 3, 5, 7, 10, 12, 18, 20, 21, 23, 24, 26, 31, 33, 39, 41, 43,45, 46, 48, 49, 51, 54, 56, 57, 59, 63, 65, 66, 68, 71, 73, 88, 90, 104,106, 107, 109, 111 or 113; and d) the amino acid sequence of SEQ IDNO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89,105, 108 or 112; comprising culturing the host cell of claim 3 underconditions in which the nucleic acid molecule is expressed.
 8. A methodfor detecting the presence of a nucleic acid molecule of claim 1 or apolypeptide encoded by the nucleic acid molecule in a sample,comprising: a) contacting the sample with a compound which selectivelyhybridizes to the nucleic acid molecule of claim 1 or binds to thepolypeptide encoded by the nucleic acid molecule; and b) determiningwhether the compound hybridizes to the nucleic acid or binds to thepolypeptide in the sample.
 9. A kit comprising a compound whichselectively hybridizes to a nucleic acid molecule of claim 1 or binds toa polypeptide encoded by the nucleic acid molecule and instructions foruse.
 10. A method for identifying a compound which binds to apolypeptide or modulates the activity of the polypeptide of claim 4comprising the steps of: a) contacting a polypeptide, or a cellexpressing a polypeptide of claim 4 with a test compound; and b)determining whether the polypeptide binds to the test compound ordetermining the effect of the test compound on the activity of thepolypeptide.
 11. A method for modulating the activity of a polypeptideof claim 4 comprising contacting the polypeptide or a cell expressingthe polypeptide with a compound which binds to the polypeptide in asufficient concentration to modulate the activity of the polypeptide.12. A method for identifying a compound capable of treating a disordercharacterized by aberrant 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 activity, comprising assaying the ability ofthe compound to modulate 21910, 56634, 55053, 2504, 15977, 14760, 25501,17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983,m1983, 38555 or 593 nucleic acid expression or 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 polypeptide activity,thereby identifying a compound capable of treating a disordercharacterized by aberrant 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 activity.
 13. A method of identifying anucleic acid molecule associated with a disorder characterized byaberrant 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 activity, comprising: a) contacting a sample from a subject with adisorder characterized by aberrant 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 activity, comprising nucleic acidmolecules with a hybridization probe comprising at least 25 contiguousnucleotides of SEQ ID NO:1, 5, 10, 18, 21, 24, 31, 39, 43, 46, 49, 54,57, 63, 66, 71, 88, 104, 107 or 111 defined in claim 2; and b) detectingthe presence of a nucleic acid molecule in the sample that hybridizes tothe probe, thereby identifying a nucleic acid molecule associated with adisorder characterized by aberrant 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 activity.
 14. A method of identifyinga polypeptide associated with a disorder characterized by aberrant21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529,26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593activity, comprising: a) contacting a sample comprising polypeptideswith a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 polypeptide defined in claim 4; and b) detecting the presence of apolypeptide in the sample that binds to the 21910, 56634, 55053, 2504,15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610,33217, 21967, h1983, m1983, 38555 or 593 binding partner, therebyidentifying the polypeptide associated with a disorder characterized byaberrant 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 activity.
 15. A method of identifying a subject having a disordercharacterized by aberrant 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 activity, comprising: a) contacting a sampleobtained from the subject comprising nucleic acid molecules with ahybridization probe comprising at least 25 contiguous nucleotides of SEQID NO:1, 5, 10, 18, 21, 24, 31, 39, 43, 46, 49, 54, 57, 63, 66, 71, 88,104, 107 or 111 defined in claim 2; and b) detecting the presence of anucleic acid molecule in the sample that hybridizes to the probe,thereby identifying a subject having a disorder characterized byaberrant 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 activity.
 16. A method for treating a subject having a disordercharacterized by aberrant 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 activity, or a subject at risk of developinga disorder characterized by aberrant 21910, 56634, 55053, 2504, 15977,14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217,21967, h1983, m1983, 38555 or 593 activity, comprising administering tothe subject a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903,3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983,38555 or 593 modulator of the nucleic acid molecule defined in claim 1or the polypeptide encoded by the nucleic acid molecule or contacting acell with a 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 modulator.
 17. The method defined in claim 16 wherein said disorderis a cellular proliferative and/or differentiative disorders, braindisorders, platelet disorders, breast disorders, colon disorders, kidney(renal) disorders, lung disorders, ovarian disorders, prostatedisorders, cervical disorders, spleen disorders, thymus disorders,thyroid disorders, testes disorders, hematopoeitic disorders, pancreaticdisorders, skeletal muscle disorders, skin (dermal) disorders, disordersassociated with bone metabolism, immune, e.g., inflammatory, disorders,cardiovascular disorders, endothelial cell disorders, liver disorders,viral diseases, pain disorders, metabolic disorders, neurological or CNSdisorders, erythroid disorders, blood vessel disorders or angiogenicdisorders.
 18. The method of claim 16, wherein the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 modulator is a) a smallmolecule; b) peptide; c) phosphopeptide; d) anti-21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 antibody; e) a 21910,56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176,26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or 593polypeptide comprising the amino acid sequence of SEQ ID NO:2, 6, 11,19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112,or a fragment thereof; f) a 21910, 56634, 55053, 2504, 15977, 14760,25501, 17903, 3700, 21529, 26176, 26343, 56638, 18610, 33217, 21967,h1983, m1983, 38555 or 593 polypeptide comprising an amino acid sequencewhich is at least 90 percent identical to the amino acid sequence of SEQID NO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89,105, 108 or 112, wherein the percent identity is calculated using theALIGN program for comparing amino acid sequences, a PAM120 weightresidue table, a gap length penalty of 12, and a gap penalty of 4; or g)an isolated naturally occurring allelic variant of a polypeptideconsisting of the amino acid sequence of SEQ ID NO:2, 6, 11, 19, 22, 25,32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112, wherein thepolypeptide is encoded by a nucleic acid molecule which hybridizes to acomplement of a nucleic acid molecule consisting of SEQ ID NO:1, 5, 10,18, 21, 24, 31, 39, 43, 46, 49, 54, 57, 63, 66, 71, 88, 104, 107 or 111at 6×SSC at 45° C., followed by one or more washes in 0.2×SSC, 0.1% SDSat 65° C.
 19. The method of claim 16, wherein the 21910, 56634, 55053,2504, 15977, 14760, 25501, 17903, 3700, 21529, 26176, 26343, 56638,18610, 33217, 21967, h1983, m1983, 38555 or 593 modulator is a) anantisense 21910, 56634, 55053, 2504, 15977, 14760, 25501, 17903, 3700,21529, 26176, 26343, 56638, 18610, 33217, 21967, h1983, m1983, 38555 or593 nucleic acid molecule; b) is a ribozyme; c) the nucleotide sequenceof SEQ ID NO:1, 5, 10, 18, 21, 24, 31, 39, 43, 46, 49, 54, 57, 63, 66,71, 88, 104, 107 or 111 or a fragment thereof; d) a nucleic acidmolecule encoding a polypeptide comprising an amino acid sequence whichis at least 90 percent identical to the amino acid sequence of SEQ IDNO:2, 6, 11, 19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89,105, 108 or 112, wherein the percent identity is calculated using theALIGN program for comparing amino acid sequences, a PAM120 weightresidue table, a gap length penalty of 12, and a gap penalty of 4; e) anucleic acid molecule encoding a naturally occurring allelic variant ofa polypeptide comprising the amino acid sequence of SEQ ID NO:2, 6, 11,19, 22, 25, 32, 40, 44, 47, 50, 55, 58, 64, 67, 72, 89, 105, 108 or 112,wherein the nucleic acid molecule which hybridizes to a complement of anucleic acid molecule consisting of SEQ ID NO:1, 5, 10, 18, 21, 24, 31,39, 43, 46, 49, 54, 57, 63, 66, 71, 88, 104, 107 or 111 at 6×SSC at 45°C., followed by one or more washes in 0.2×SSC, 0.1% SDS at 65° C.; or f)a gene therapy vector.